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The present invention relates to new expressions systems, 
and in particular to expression systems in which a gene of 
interest is expressed at an optimal level. Particular 

examnlps of enrh o^r>-y-==-^-; .-.v. i ___ 

- - -i-- «^<= iciiuvirai packaging 

cell lines and a number of preferred cell lines have been 

identified. 



The ability of eukaryotic and prokaryotic ribosomes to 
reinitiate translation at an internal start codon within an 
mRNA sequence has previously been recognised. Studies have 
been reported in which the efficiency of the process, which 
is generally regarded as being low, has been connected with 
the length of the interciscronic sequence (Kozak (1987) 
Mol. Cell Biol. 7, 3438-3445). Selection of this sequence 

or spacer as 70bp in length, and containing no other start 
codons, has been previously reported as being optimal for 
20 reinitiation in a eukaryotic cell line (Cosset F-L. , 

Virology (1991) 185, 862) . 

The applicants have found a way in which the inefficiency 
associated with the translation reinitiation process can be 
25 used to good effect. 

According to the present invention there is provided a 
recombinant expression vector comprising a gene of interest 
and a selectable marker gene, wherein the selectable marker 

30 gene is arranged downstream of the gene of interest and a 

stop codon associated with the gene of interest is spaced 
from a start codon of said selectable marker gene at a 
distance which is sufficient to ensure that translation re- 
initiation is required before said selectable marker protein 

3 5 is expressed from the corresponding mRNA . 
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The invention further provides a process for producing cell 
lines xn which a gene of interest is expressed, which 
process comprises transf orming host cells with an excression 
vector comprising said gene of interest and a selectable 
narker gene, wherein the selectable marker gene is arranged 
downstream of the gene of interest and a stop codon - 
associated with the gene of interest is spaced from a start 
codon of said selectable marker gene at a distance which is 
sufficient to ensure that translation re-initiation is 
required before said selectable marker protein is expressed 
from the corresponding mRNA, and selecting those cells where 
expression of the selectable marker gene may be detected. 

Since re-initiation of translation is a relatively 
inefficient process, this means that the selectable 
protein will be expressed at lower levels than the product 
of the gene of interest. When the marker protein is 
expressed at detectable levels, the gene of interest will be 
expressed at higher levels. This will ensure that during 
the subsequent selection procedure, only those cell clones 
which express the gene of interest at higher or optimal 
levels will surviv e. Low expressing clones will be 
eliminated by the selection process. 

Cells transformed with the above -described expression 
vectors form a further aspect of the invention. 

The host cells are suitably eukaryotic or prokaryotic host 
cells, preferably eukaryotic host cells. 

The number of nucleotides in the space between the stop 
codon of the gene of interest and the start codon of the 
selectable marker will suitably be in the range of from 20- 
200 nucleotides, preferably from 60-80 nucleotides, even 



more preferably 70-80 nucleotides 



WO 97/08330 



PCT/GB96/0206I 



3 

The vectors used in the process of the invention may be ary 
of the known types, for example expression plasmids or viral 



vectors . 



Selected cells may be cultured and if required, the' orotein 
product of the gene of interest isolated from the culture 

usina convent •> 1 t-s.-v.^-:.-.-.-. -- - - 

~ J . t .c=.L-iiciL..Lvexy y expression of 

the gene of interest may result in other desired effects, 

for example, where the gene of interest is included as part 

of a viral packaging construct . 

Some experimental and clinical gene transfer protocols 
require the design of gene transfer vectors suitable for in 
vivo gene delivery (Miller, A.D. 1992. Nature 357:455- 
460) . Retroviral vectors are attractive candidates for 
such applications, because they can provide stable gene 
transfer and expression (Samarut J. et al.,Meth. Enzymol . in 
press) and because packaging cells have been designed which 
produce non-replication competent viruses (Miller A.D (1990) 
Hum Gene Ther . 1 5-14) . However currently available 
recombinant retroviruses suffer from a number of drawbacks. 

Packaging cell lines provide in trans the retroviral 
proteins encoded by the gag., pjol, and env genes required to 
obtain infectious retroviral particles. The gag; and pol 
products are respectively the structural components of the 
virion cores and the replication machinery (enzymes) of the 
retroviral particles whereas the env products are envelope 
proteins responsible for the host -range of the virions and 
for the initiation of infection and for sensitivity to 
humoral factors. An ideal packaging cell line should 
produce retroviruses that only contain the retroviral vector 
genome, and absolutely no replication-competent genomes or 
defective genomes encoding some of the viral structural 
genes . 
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A number of packaging cell lines designed for human gene 
transfer have been designed in the past by introducing 
plasmid DNAs which contain "helper genomes" encoding gag, 
pol and/or env genes into cells. 

Retroviral packaging cell lines are cells that have been 
engineered to provide in trans all the functions required to 
express infectious retroviral vectors. A helper genome (or 
construct or unit), is herein also referred to as 
"retroviral packaging construct (or unit)" or "packaging- 
deficient construct (or genome unit)" or "ga.g-pol/env 
expression plasmids" . 

Much efforts has been made to design strategies to optimize 
the helper-genomes in order (i) to get the highest 
production of retroviral packaging functions (which 
correlates which infection titers of retroviral particles) 
and (ii) to minimise the chance that the helper genome can 
be transmitted via the viral particles (which may lead to 
emergence of unwanted retroviral forms) . 

The first of these packaging cell lines used full length 
retroviral genomes as helper genomes that had been crippled 
for important cis-regulated replicative functions (reviewed 
in Miller, Hum. Gene. Ther . 1:5-14 1990). In order to reduce 
the possibility of occurrence of replication-competent 
viruses and of transfer of virus structural genes, a second 
generation of safer packaging cell lines has been designed 
by using two separate and complementary helper genomes which 
express either gag-pol or env and are packaging-deficient 
(Miller supra) . 

The cells into which these helper genomes were introduced 
were isolated by cotransf ecting them with plasmids encoding 
selectable markers. However, as no selection was applied on 
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the packaging-deficient retroviral geno.e itself, the' helper 
functions can be lost during the passages of the cells in 
culture and the current packaging systems provide l^ted 
titers of infectious retroviral vectors, usually only of the 
order of io»-io« infectious units :.uM. Indeed the 
cotransfection with a plasmid encoding a selectable marker 
aoes not directly select the best c^cr-^i 

- *■ • — xz ^ j_liq 

cells. ^ 



The invention further provides a retroviral packaging cell 
line comprising a host cell transformed with (i, a packaging 
deficient construct which expresses a viral gag-pol gene and 
a first selectable marker gene, and/or (ii) « packaging- 
defxdent construct which expresses a viral env gene and a 
second selectable marker gene,- wherein a start codon of the 
first and second selectable markers are spaced from the stop 
codons of the viral gag-pol gene and the viral env gene 
respectively by a distance which ensures that reinitiation 
of mRNA translation is required for expression of marker 
protein product of said first and/or second selectable 
marker gene. 

The retroviral packaging cell line may be obtained by the 
above described process which will involve selecting 
transfected cells which express said first and/or second 
marker genes . 

By using helper constructs which are directly selectable and 
which provide for high expression of the viral gene, high 
titre retroviral vectors may be obtained. 

Helper constructs for use in the process form a furtner 
aspect of the invention. 

The retroviral vectors prepared from the conventional 
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packaging cell lines are usually not contaminated by " 
replication-competent retroviruses (RCRs) . However, 
recombinant amphotropic murine retroviruses have been shown 
to arise spontaneously from certain packaging cell lines. 
The generation of such RCRs involves recombination at least 
between gag-pol/env packaging sequence and vector sequences 
(Cosset et al., Virology, (1993) 193:385-395). 

Recombinant RCRs have been associated with the development 
of lymphomas in some severely immunosuppressed monkeys 
(Donahue et al . , J. Exp Med (1992) 176: 1125-1135). m 
addition, retroviral vector preparations may also contain, 
at low frequencies, retroviruses coding for functional 
envelope glycoproteins (Kozak and Kabat, 1990, J. Virol . 64: 
3500-3508) or for gag-pol proteins. Although the 
pathogenicity of these gag-pol or env recombinant 
retroviruses is probably low, more evolved recombinant 
retroviruses with higher pathogenic potential may occur when 
injected in vivo , by recombination and/or complementation of 
the initial recombinant viruses with some endogenous 
retroviruses . 

In a preferred embodiment of the retroviral packaging cell 
lines of the invention, the overlapping sequences between 
the genomes of the retroviral vector and the helper 
construct are reduced, for ■ example as compared to constructs 
such as CRIPenv and CRIPAMgag (Danos et al . , Proc . Natl. 
Acad. Sci USA 85: 6460-6464) . In particular, the viral 
sequences in the helper construct are reduced, for example, 
not only the packaging sequence but also the 3' Long 
Terminal Repeat (LTR) , the 3' non-coding sequence and/or the 
5 ' LTR may be eliminated. 

The possibility of generation of such RCRs and recombinant 
retroviruses can be reduced by reducing the overlapping 
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sequences between the genomes of both the retroviral vector 
and the helper construct. 

Conventional retroviral vectors are strongly inactivated by 
human serum which makes them of limited or no use for in 
SitU gene transfer in gene therapy applications. it has 
previously Lee., siwwn thai, inactivation by complement in 
human serum is controlled by the cell l ine use d to produce 
the vxrions and by viral envelope determinants (Takeuchi et 
al., J. Virol (1994) 68:8001-8007). In particular 
inactivation is caused by some properties of the cell lines 
that have been used to construct the packaging cells (NIH- 
3T3) and also by viral determinants located in the 
retroviral envelope as shown (Takeuchi et al j virol 
(1994) «:B00X-8007,. Invivo_ ge ne delivery is an important 
goal for a number of human gene therapy strategies. 

The applicants have found that certain cell lines form 
preferred packaging cell lines. 

Particularly preferred packaging cell lines are the HT1080 
line, the TE671 line, the 3T3 line, the 293 line and the Mv- 
1-Lu lme. One example of retroviral packaging cells that 
will produce complement -resistant virus comprise human 
HT1080 cells and express RD114 envelope. Such cells form a 
preferred aspect of the invention. 

Packaging cell lines according to the invention provide 50- 
100 fold increased titers of retroviral vectors as compared 
to conventional packaging cell lines. Retroviral vectors 
provided by these new cells are safe, in terms of generation 
of RCRs, and considerably more resistant to inactivation by 
human complement . 

Packaging cell lines according to the invention may be able 
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to transduce helper-free, human complement-resistant " 
retroviral vectors at titers consistently higher than 10' 



i -u. /ml . 



Suitable semi-packagxng cell lines i n accordance with the 
invention are those which express only the gag-pol genes 
Such cell lines may suitably be derived from TES71 , MINk'mv- 
1-Lu. HT1080, 293 or NIH-3T3 cells by introduction of 
plasmid CeB (the MoMLV gag-pol expression unit) . 

Particularly preferred expression vectors in accordance with 
the invention for use in retroviral packaging cell lines are 
those which include MLV gag and pol genes such as CeB 
Other plasmids m ay include gag and pol genes from other 
retroviruses or chimeric or mutated gag and pol genes . 

Various viral and retroviral envelope genes may be included 
xn the plasmids such as MLV-A envelope, GALV envelope VSV-G 
protein, BaEV envelope, RD114 envelope and chimeric or 
mutated envelopes. Plasmids which include the RD114 env 
gene such as FBdelPRDSAF as illustrated hereinafter, provide 
one example of suitable constructs. 

The novel retroviral packaging cells described hereinafter 
have been designated FLY cells, and may be designed for in 
vivo gene delivery. 

Considerable variations were found between the various cell 
Ixnes screened for their ability to release type C mammalian 
retroviruses, m addition, few cell lines were able to 
produce retroviruses completely resistant to human 
complement. Based on these two criteria, human fibrosarcoma 
HT1080 and rhabdomyosarcoma TE671 cells were selected for 
optimum construction of packaging cells. 
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Other studies have shown the importance of endogenous' 
retrovirus expression in the generation of recombinant 
retroviruses from retroviral packaging lines (Ronfort et 
al., Virology, (1995), 207, 271-275, Vanin, E F et al J 
Virol (1994) 68:4241-4250.). The co-packaging of an 
endogenous genome and a vector can lead to emergence of 
recombinant retrnvinic^c. ^ ~ 

• / oupid/ . 

Recombination involves template switching during reverse 
transcription of such hybrid retroviruses (Hu et al . , 
Science, (1990) 250:1227) and homologies between the ' two 
genomes considerably enhance the frequency of reverse 
transcriptase jumps (Zhang et al., J. Virol. (1994) 68- 
2409-2414). Therefore an ideal packaging cell line should 
not express endogenous MLV-like (or type C retrovirus-like) 
retroviral genomes which can be packaged by type C gag 
proteins (Scadden et al . , J. Virol. (1990) 64: 424-427, 
Torrent et al . , J. Mol . Biol. (1994) 240 434-444). 

Packaging of human endogenous retroviral RNA was not 
detected in TELCeB and FLY packaging cells when virion 
associated RNA was analysed by RT-PCR using generic primers. 

HT1080- and TE671 derived packaging cell lines may be safer 
in this respect than those generated from NIH3T3 cells, such 
as GP+EAM12 cells, which are known to express and package 
sequences related to type C retroviruses (Scadden et al . 
supra) . 

To generate the FLY packaging cell lines, HT1080 cells were 
transfected with gag-pol and env expression plasmids 
designed to optimise viral protein expression. Direct 
selection for viral gene expression was achieved in 
accordance with the invention by expression of a selectable 
marker gene by re - initiation of translation of the mRNA 
expressing the viral proteins. This strategy resulted in 
packaging cell lines capable of producing extremely high 
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titer viruses. Furthermore, long-term expression of ' 
packaging functions can be maintained in these cells Many 
unnecessary viral sequences were eliminated from the 
packaging constructs to reduce the risk of helper virus 
generation; indeed the final packaging cells did not produce 
helper virus, in that no replication competent virus "(RCR) 
could be detected per 10 7 vector particles. 

The FLY packagmg cells described herein are safer than for 
example, psiCRIP cells, at least for generation of env ' 
recombinant retroviruses as is illustrated in Table 4 
hereinafter, probably because less retroviral sequences 
overlapping with the vector were present in the present env- 
expression plasmid. Few reports have addressed the question 
of the characterization of recombinant retroviruses (RVs) 
(Cosset, F.L.. et al . , Virology (1993) 193:385-395). It is 
possible that such RVs could not be detected in previous 
packaging cell lines due to lower overall titers. RVs are 
defective in normal cell culture conditions but are likely 
to evolve to replication competent viruses if they are 
allowed to replicate in cells complementing their expression 
like co-cultivated packaging cells (Bestwick et al . , Proc . 
Natl Acad Sci USA, (1988) 85: 5404-5408, Cosset et al., 
(1993) supra) . 

In preferred retroviral packaging systems according to the 
invention, RVs are eradicated for example by removal of 
viral LTRs from the packaging construct . 

Consistent with our previous studies (Takeuchi, Y. , et al 
J Virol (1994) 68:8001-3007), LacZ(RD114) and LacZ (MLV- A) 
pseudotypes produced from HT1080 and TE671cells were more 
resistant to human complement than LacZ(RD114) or LacZ (MLV - 
A) pseudotypes produced by 3T3 of dog cells. it was 
therefore decided to use RD114 and MLV- A env genes to 
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25 
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11 

generate recombinant virions with MoMLV cores. 

The sequence of RD114 env gene was determined and is shown 
in Figure 4. It was found to be very close to BaEV .(baboon 
endogenous virus) a type C retrovirus (Benveixiste , R.E.et 
al., Proc. Natl. Acad. Sci . USA (1973) 70 : 3 3 16 - 3 3 2 0 ; ~Kato, 
£.et ul. f Japau. j. Geuet. liybV) 62:127-137) with an 
envelope gene displaying similarities to the external part 
of type D simian retroviruses (SRVs) . RD114 uses the SRV 
receptor on human cells (Somraerfelt & Weiss, Virology 
(1990) 176:58-69; Sommerf elt , M.A. et al . , J Virol (1990) 
64:6214-6220) making the FLY packaging cells with RD114 
envelope capable of generating virions with different 
tropism. Retroviral vectors prepared so far for human gene 
15 therapy have used either MLV-A or GALV (gibbon ape leukemia 

virus) envelopes which display some similarities (Battini, 
J.L.,et al., J Virol. (1992) 66:1468-1475) and which use two 
related cell surface receptors for infection (Miller, D.G. 
et al., J Virol (1994) 68:8270-8276). Differences in tissue- 
specific expression of MLV-A or GALV receptors have been 
reported (Kavanaugh et al . , Proc Natl Acad Sci USA (1994) 
91:7071-7075) . 



35 



The invention will now be particularly described by way of 
example with reference to the accompanying drawings in 
which : 

Figure 1 . illustrates the structure and expression of CeB. 
The env gene (Xbal-Clal) of plasmid pCRIP was removed and 
was replaced by coinsertion of the two fragments Xbal-Sfil 
(restriction sites underlined) from pOXEnv and a Sfil-Clal 
PCR product containing the bsr selectable marker. This" 
results in positioning the bsr start codon (shadowed) 74 bp 
downstream to the pol stop codon (bold) 
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Open triangle are stare codons (oag and bsr) , black " 
triangles are stop ccdons (qo! and bsr) . The shadowed 
triangle is the start codon of env, in che same reading 
frame with that of bsr. SD and SA are che spl±ce 
splice acceptor sites. 

Figure 2 illustrates the structure and expression of 
FbdelPASAF. 

immediately after the stop codon of env (bold) was inserted 
a non retroviral Kasl-Ncol (restriction sites underlined) 
linker which positions the phleo start codon (shadowed) 76 
bp downstream. 

Open triangle are start codons (env and phleo) , black 
triangles are stop codons (env and phleo ) . SD and SA are 
the splice donnor and splice acceptor sites. 

Figure 3 illustrates plasmids for expression of Ampho, Eco 
RD114. Xeno, 10A1, GALV, VSV-G and FeLVB envelopes. 

All genes are expressed in the same backbone as detailed in 
fig- 2. The Bglll sites for ecotropic (MoMLV strain) 
10A1, xenotropic (N2B.1.V6 strain) and amphotropic (4070A 
strain) , the Ndel site of RD114 (SC3C strain, the BamHl site 
for both FeLVB and GALV were used as 5' ends, and linked to 
Mscl site immediately after the splice donor site in the 
leader of FB2 9 LTR. 

Figure 4 shows the sequence of the RD114 env gene (SEQ ID No 

1 ) • 



Fxgure 5 shows the genetic structure of gag-pol constructs, 
initiation W and termination ( T) codons are shown. The 
thick dotted line below each construct shows MLV-derived 
sequences. Nucleotide positions of MLV-derived sequences are 
shown according to: Shinnick et al . (i 98 l) ( from nt 1 to nz 
6000 with deletion of the packaging signal ( DY) from Ball 
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(nt 215) to Psci (nt 568), and with some further MoMLV 
sequences in both CeB and Ce3 DS- from nt 7676 to nt 793 8 
gag-pol and bsr genes were expressed f rora the same 

(Mo LTR) or a non retroviral promoter (hCMV) and non 

?,TlZ a Ll^ ade T^ ti0n SeqU6nCe (P °^ A > ■ ^nor 
"" aie indicated. The thin line 
denotes retroviral non coding sequences. The thick line 
shows the rabbit beta-1 globin intron B. The position of 
some restriction sites is indicated. 

The nucleic acid sequences of portions of constructs (as 
shown in Figuxe S < boxed areas) , are digplayed ^ ^ 

ID No 2, Figuxe 6), hCMV + intron (SEQ ID No 3, Figure 7) and 
hCMV + intronkaSD (SEQ ID No 4, Figure 8). 



The nucleic acid sequences of portions of constructs (as 
shown in Figure 3 (boxed areas)) are displayed for 
FbdelPASAF (SEQ ID No 5, Figure 9), Fbde 1 PMOSAF (SEQ ID No 
6, Figure 10) , Fbde 1 PGASAF (SEQ ID No 7, Figure 11) 
FbdelPRDSAF. (SEQ ID No8, Figure 12) and CMV10A1 (SEQ ID No 
9, Figure 13) are shown. 

The components of the viral particles are produced by two 
independent expression plasmids (ga^-pol or env) which also 
contain selectable markers (bsr or ^hle^) expressed from the 
same transcriptional units as gag-pol or env (fi gs 1& 2) 
The selectable markers are located downstream to sag-pol or 
env genes and there is an optimal distance between the stop 
codon of the upstream reading frames and the start codon of 
the selectable genes that should allow re-initiation of 
translation (Kozak, Mol Cell Biol. (1987) 7, : 3438-3445) 
Because there is no "Kozak" sequence (Kozak, Cell,(i 98 6) 44- 
283-292) required for a normal initiation of translation for 
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the marker gene, they can only be expressed by re-initiatxon 
of translation after the upstream viral gene has been 
successfully expressed. Consequently and also because re- 
initiation of translation is a poorly efficient process, 
after transfection of these plasmids, cells resistant to th= 
drugs corresponding to those selectable genes express" high " 
levels of the viral proteins. 

To avoid viral transmission of these "helper" genomes the 
constructs used suitably have the classical deletions of 
both the packaging sequence located in the leader region and 
of the 3'LTR, the latter being replaced by SV40 
polyadenylation sequences (Figs 1 & 2) . 

Plasmid CeB is the MoMLV gag-pol -expression .unit . It 
derives from pCRIP, a plasmid used to generate the 
constructs introduced in the CRIP and CRE packaging cell 
lines (Danes and Mulligan, 1988). As shown in fig. i for 
generation of plasmid CeB the env gene of pCRIP has been 
deleted mostly and the bsr selectable marker, -encoding a 
protein conferring resistance to blasticidin (Izumi et al . , 
Experimental Cell Research (1991) 197, 229-233)- has been 
inserted downstream to pol gene. There are exactly 74 bp 
with no ATG triplets between the stop codon of pol and the 
start codon of bsr, this allows its expression by re- 
initiation of translation on the gag-pol mRNA, after 
translation of the g^a-pol reading frame. 

FbdelPASAF is a plasmid expressing the amphotropic env gene 
and the phleo selectable marker conferring resistance to 
phleomycin (Gatignol et al . , FEBS Letters (1988) 230:171- 
175) . By using a PCR-mediated mutagenesis strategy which 
modifies the end of env gene (see fig. 2), a 76 bp linker 
was inserted between the stop codon of env and the start 
codon of phleo. This allows expression of phleo from the 
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env mRNA by re- inatiation of translation. m addition 
compared to known env-expressing constructs, this strategy 
of construction has reduced the length of sequences 
overlapping with the ends of conventional retroviral 
vectors. The env genes of Mo-MLV, FeLVB , NZB.1V6 10A1 

t^^t™ 114 SX P resSed b >' Plasmids FBdelPMoSAF, ' 

FSdciPBSAF, .'-.iXoAr, radeipcssAF, FBdelplOAlSALF and 
FBdelPRDSAF, respectively, by using the same backbone as 
FBdelPASAF (fig. 3) . Retroviral vectors produced with the 
RD114 envelope will be useful for in vivn gene delivery as 
comparatively to MLV ecotropic or amphotropic envelopes 
" lri ° nS P se ^otyped with RDH4 envelopes are not inactivated 
by human complement when they are produced by Mink Mv-i-Lu 
cells or by some human cells (Table 1) . 

The HT1080 cell line, isolated from a human fibrosarcoma 
(ATCC CCL121) . The TE671 cell line isolated from a human 
rhabdomyosarcoma (ATCC CRL 8 8 05) (purchased from ATCC and 
tested for absence of usual cell culture contaminants by 
ECACC) , has been used for the definitive construction of 
packaging cell lines. HTX080 line was chosen among a panel 
of primate and human lines because MLV-A and RD114 
efficiently rescued retroviral vectors from these cells and 
also because RD114 pseudotypes produced by this cell line 
were stable when incubated in human serum. in a standard 
assay (Takeuchi et al . , J Virol (1994), 68, 8001-8007), 
these latter viruses were found more than 500 fold more 
stable than similar pseudotypes produced in 3T3 cells. 

Another advantage for the use of non murine cells to derive 
packaging lines 1S the absence of MLV-related endogenous 
retroviral-like sequences (like VL30 in 3T3 cells) that can 
cross-package with MLV-derived retroviral vectors (Torrent 
et al., 1994) and generate potentially harmful recombinant 
retroviruses . 
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The helper constructs were introduced into other cell lines 
(HT1080 (table 2) Mink Mv-l-Lu (table 2), 3T3 (not shown), 
TE671 (table 2)) for the purpose of comparisons of the 
efficiency of the constructs. 

As illustrated hereinafter (Table 2), the reverse 
transcriptase (RT) activity (provided by expression of the 
pol gene) in cells transfected with CeB is significantly 
higher than that of the same cells transfected by the 
parental plasmid pCRIP or that of cells chronically infected 
by MLV. This enhancement of viral gene expression is 
correlated with the titers of lac2 retroviral vectors when 
an envelope is provided in CeB-lacZ cells after comparison 
with titers of lacZ pseudotypes of either replication- 
competent viruses or other helper- free packaging systems. 

For the generation of final packaging cell lines, the best 
clonal env transf ectants have been selected. Packaging 
systems obtained in this way will be able to produce helper- 
free retroviral vectors at titers greater than lo 8 
infectious particles per ml, which would be 10-100 fold 
higher to helper-free preparations of others. 

Because of the way the selectable markers are expressed (see 
above) , growing the packaging cells in phleomycin and 
blasticidin selective pressure increase and stabilize the 
expression of the retroviral components and particularly the 
envelopes, as it is possible that env glycoproteins have 
toxic effects for the producer cells in the long term which 
may lead to a decrease of expression. 



Such an enhancement of viral production observed with the 
packaging systems described herein might increase the 
emergence of unwanted retroviruses having recombined between 
35 the genomes of both the retroviral vector and either of the 
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two packaging-deficient constructs. However, the constructs 
have been designed in such a way that it reduces the 
probability of emergence of recombinant viruses compared to 
the parental constructs. To check their safety, attempts 
5 have been made to detect the presence of replication- 

competent retroviruses by a mobilisation assay of a lacZ 
^ :::v: ^' Wo RC "viruses nave been found in all retroviral 
vector preparations tested so far. 

10 The following Examples illustrate the invention. 

Example 1 

Preparation of Cell lines and viruses. 
15 The following cell lines were used: 

A204 (ATCC HTB 82), HeLa (ATCC CCL2 ) , HT1080 (ATCC CCL121), 
MRC5 (ATCC CCL171), T24 (ATCC HTB 4), VERO (ATCC CCL81) and 
D17 (ATCC CCL183) were purchased from ATCC. 



20 HOS, TE671 and Mv-l-Lu cells and their clones harboring 

MFGnlslacZ retroviral vector as described by Takeuchi et 
al . , J Virol (1994), 68, 8001-8007. 

The above cell lines were grown in DMEM (Gibco-BRL, U.K.) 
25 supplemented with 10% fetal calf serum. 

EB8 (Battini et al . , J. Virol (1992) 66: 1468-1475); 
psiCRE, psiCRELLZ and psiCRIP (Danos et al . , Proc. Natl. 
Acad. Sci USA (1988) 85: 6460-6464); 
30 Cells GP+EAM12 (Markowitz et al . , Virology (1988), 167, 400 

406) ; and 

NIK-3T3 murine fibroblasts. 

These cell lines were grown in DMEM (GIBCO-BRL,, U.K.) 
35 supplemented with 10% new-born calf serum. 
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Mv-l-Lu, TE671 and HT1080 cells were transfected using 
calcium-phosphate precipitation method (Sambrook et 
"Molecular Cloning" 1989, Cold Spring Harbour Laboratory 
Press: New York) as described elsewhere (Battini et al . , 
supra) . CeB-transfected Mv-l-Lu, TE671 and HT1080 cell's 
were selected with 3, 6-8 and 4 M g/ml of blasticidin S (ICN 
UK) , respectively, and blasticidin-resistsuit colonies were ' 
isolated 2-3 weeks later. Cells transfected with the various 
env-expression plasmids were selected with phleomycin 
( CAYLA, France) : 50 M g/ml (for FBASALF- transfected cells) or 
10 /xg/ml (for FBASAF-, FbdelPASAF- , FbdelPMOSAF 
FBdelPIOAISAF or FBdelPRDSAF- transfected cells)! Phleomycin- 
resistant colonies were isolated 2-3 weeks later. 

Production of lacZ pseudotypes using replication competent 
viruses, amphotropic murine leukemia virus (MLV-A) 1504 
strain and cat endogenous virus RD114, was carried out as 
described previously (Takeuchi et al . , j virol (1994), 68, 
8001-8007) . 

Example 2 

Preparation of Plasmids. 

The env gene of pCRIP (Danes et al . , supra) was excised by 
Hpal/Clal digestion. A 500 bp PCR-generated DNA fragment was 
obtained using P SV2-bsr (Izumi et al . , Experimental Cell 
Research (1991), i 97< 299-233) as template and a pair of 
oligonucleotides: 

(5 ' >CGGAATTCGGATCCGAGCTCGGCCCAGCCGGCCACCATGAAAACATTTAACATTTC 
TO (SEQ ID NO 2) at 5' end and 

(5' >GATCCATCGATAAGCTTGGTGGTAAAACTTTT) (SEQ ID No 3 ) at: 3 ' 
end, with Sfil and Clal sites, respectively. This fragment 
was inserted in Hpal/clal sites of pCRIP by co-ligation with 
a 85 bp Hpal/Sfil DNA fragment isolated from pOXEnv (Russell 
et al., Nucleic Acids Research (1993), 21, 1081-1085) which 
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provides the end of the Moloney murine leukemia virus 
(MoMLV) pel gene. The resulting plasmid named CeB (Fig i) 
could express the MoMLV gag-pol gene as well as the bsr 
selectable marker conferring resistance to blasticidin S 
both driven by the MoMLV 5 ' LTR promoter. 

a sen« of mv sxprus^ioxi pia«mias was generated using the 
4070A MLV (amphotropic) env gene (Ott et al. f j virol 

(1990) , 64, 757-766) and the FB29 Friend MLv'promoter 
(Perryman et al . , Nucleic Acid Res (1991), 19 6950) In 
FBASALF (Fig. 1, a Bglll/Clal fragment containing the env 
gene was cloned in BamHI/Clal sites of plasmid FB3LPh which 
also contained the C57 Friend MLV LTR driving the expression 
or the phleo selection marker. A 136 bp env fragment was 
generated by PGR using plasmid FB3 (Heard et al . , j virol 

(1991) , 65, 4026-4032) as template and a pair of 
oligonucleotides: (5 ' >GCTCTTCGGACCCTGCATTC) (SEQIDN0 4) at 
5' end (before Clal site) and 

(5 ' >TAGCATGGCGCCCTATGGCTCGTACTCTATAGGC) (SEQ ID NO 5) at 3' 
end, providing a KasI restriction site immediately after the 
env stop codon. This PGR fragment was digested using Clal 
and Kasl. ADNA fragment containing the FB29 LTR and the 
MLV-A env gene was obtained by Ndel/Clal digestion of 
FBASALF. The fragments were co-ligated in Ndel/KasI digested 
PUT626 (kindly provided by Daniel Drocourt, CAYLA labs, 
France) . in the resulting plasmid, named FBASAF (Fig. 1) 
the phleo selectable marker was expressed from the same mRNA 
as the env gene. A Bglll restriction site was created after 
the Msci site at position 214 in the FB29 leader by using a 
commercial linker (Biolabs, France). A Ndel/Bgln fragment 
containing the FB29 LTR was co-inserted with the Bgln/cial 
env fragment in Ndel /Clal -digested FBASAF plasmid DNA, 
resulting in plasmid FBdelPASAF (Fig. 1). Compared to' 
FBASAF, FBdelPASAF has a lOObp larger deletion in the leader 
region. 
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Example 3 



Cloning and Sequencing of the RD114 env gene 
The RD114 env gene was first sub-cloned in plasmid 
Bluescript KS + (Stratagene) as a 3 Kb Hindlll insert 
isolated from SC3C, an RD114 infectious DNA clone (Reeves et 
al., J . Virol (1984), 52, 164-171). A 2.7 kb Seal-Hind III 
fragment of this subclone containing the RD 114 env gene was 
sequenced (Figure 4 (SEQ ID NO 1,- EMBL accession number- 
X87829). The 5' non-coding sequence upstream of an Ndel site 
was deleted by an EcoRI/Ndel digestion followed by filling 
xn with Klenow enzyme and self -ligation. From this plasmid 
two DNA fragments were obtained: a BamHI/Ncol 2 . 5 Kb 
fragment and a 63 bp PCR-generated DNA fragment using 
(5 ' >CGCCTCATGGCCTTCATTAA) (SEQ OD NO 6) at 5' end (before 
NotI site) and (5 ' >TAGCATGGCGCCTCAATCCTGAGCTTCTTCC) (SEQ ID 
NO 7) at 3' end, providing a KasI restriction site just 
after RD114 env gene stop codon . The PGR fragment was 
digested with Ncol and KasI. Both fragments were co- 
inserted between Bglll and KasI sites of FBdelPASAF and the 
resulting plasmid was named FBdelPRDSAF (Fig. i) . 

Plasmid pCRIPAMgag- (Danes, O. et al . , ' Proc Natl Acad 
Sex USA (1988) 85:6460-6464) was used for transf ect ion . 

Example 4 
Infection assays. 

Target cells were seeded in 24-multiwell plates (4x10' cells 
per well) and were incubated overnight. Infections were then 
earned out at 37°C by plating l ml dilutions of viral 
supernatants in the presence of 4 M g/ml polybrene (Sigma) on 
target cells. 3h later vi rus -containing medium was replaced 
by rresh medium and infected cells were incubated for tv 
days before X-gal staining, performed as previously 
described (Tailor et al., J Virol (1993), S7 , 6737-6741 



:wo 
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Takeuchi et al . , J Virol (1994), 68 , 8001-8007) Viral 
titers were determined by counting lacZ-positive colonies as 
previously described (Cosset et al., j. virol. (1990) 64 
1070-1078). Stability of lacZ pseudotypes in fresh human' 
serum was examined by titrating surviving vi rus after 
incubation in 1:1 mixture of virus harvest in serum-free 
medium and fresh hnm^ — ^ . 

~ " ------- * - " w <~ *s aescribed 

before (Takeuchi et al . supra). 



Example 5 

Reverse transcriptase (RT) assay. 

RT assays were performed either as described previously 
(Takeuchi et al . supra) or using an RT assay kit (Boehringer 
Mannheim, U.K.) following the manufacturer's instruction but 
using MnCl 2 (2 mM) instead of MgCl 2 . 

Example 6 

Screening producer cell lines. 

Viral particles generated with RD114 envelopes have been 
found to be more stable in human serum than virions with 
MLV-A envelopes and that the producer cell line also 
controls sensitivity (Takeuchi et al . supra). A panel of 
cell lines was screened for their ability to produce high 
titer viruses and for the sensitivity of these virions to 
human serum. To do this, cells were infected at high 
multiplicity with lacZ pseudotypes of either MLV-A or RD114 
and cells producing helper-positive lacZ pseudotypes were 
established. Human HT1080 and TES71 and mink Mv-l-Lu cells 
were found to release high titer lacZ(RD114) and lacZ (MLV-A) 
viruses. LacZ (MLV-A) pseudotypes produced by HT1080 cells 
were more resistant to human serum than those produced by 
other cells. The titer of these viruses was only four-fold 
less following a l hr incubation with human serum than a 
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control incubation (Table l). LacZ(RD114) pseudotypes" 
produced by human cells or mink Mv-l-Lu cells were in 
general stable in human serum (Table l) . These results 
suggested that HT1080, TE671 and Mv-l-Lu cells provided the 
best combination of high lacZ titers and resistance to human 
serum and they were therefore used for the generation "of 
retroviral packaging cells. 

Table 1. Titer and stability of lacZ pseudotypes. 



Producer LacZ (MLV-A) 

cel1 Titer* Stability* Titer* 



Stability* 



A204 


650 


<3 


1,200 


105 


HeLa 


9 


nd 


2, 000 


115 


HOS 


4 , 500 


6 


23 , 000 


86 


HT1080 


2 , 000 , 000 


26 


400, 000 


129 


MRC-5 


450 


10 


1,000 


nd 


T24 


350 


nd 


1,200 


nd 


TE671 


15, 000 


2 


90, 000 


38 


VERO 


260 


nd 


90 


nd 


D17 


900 


<1 


200, 000 


1 


Mv-l-Lu 


80, 000 


1 


200,000 


120 



71 cells as lacZ i.u./ml 
b: % of infectivity of human serum- treated viruses compared to fetal calf serum- 



treated viruses 



Example 7 



Construction of an improved gag-pol expression vector. 
A MoMLV gag-pol expression plasmid, CeB (Fig. i), 



was 
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derxved from pCRIP (Danes et al„ Proc. Natl. Acad Aci USA 
(1988) 85: 6460-6464). Approximately 2 Kb of env sequence 
were removed from pCRIP and the bsr selectable marker 
conferring resistance to blasticidin s (i 2U mi et al 
Experimental Cell Research (1991, 197:229-233), was'inserted 
74 nts downstream of the gag-pol gene. This 74 nts interval 

had no ATG trinlets a^H t^c- ^ . ._ 

- - — ^--v^uc an optimal 

distance between the stop codon of the P ol reading frame and 
the start codon of the bsr gene to allow re-initiation of 
translation (Kozak Mol Cell Biol., 1987, 7: 3438-3445) 
There was no -Kozak" consensus sequence (Kozak Cell ( 1986) 
44: 283-292) at the 5' end of the marker gene. Therefore 
bsr could only be expressed by re-initiatiori of translation 
after the upstream gag-pol gene had been expressed. 
Consequently, after transfection of CeB in Mv-l- 
Lu/MFGnlsLacZ (ML) , TE671/MFGnlsLacZ (TEL) or HT1080 cells 
blasticidin S-resistant bulk populations and most cell 
clones expressed high levels of gag-pol proteins assessed by 
the reverse-transcriptase (RT) activity found in cell 
supernatants (Table 2). Considerably higher RT activities 
were found in bulk populations of CeB- trans fee ted ML cells 
compared to. bulk population of ML cells stably transfected 
with the parental pCRIP construct. Similarly the RT 
activities of two packaging cell lines generated using 
pCRIPenv- construct, psiCRE cells (Danes et al., supra) and 
EB8 cells (Battini supra.) were less than that of CeB 
transfected clones (Table 2). Finally, RT activitiy in CeB 
transfected cell supernatants was higher than that of cells 

chronically infected by replication-competent MLV-A (Table 
2) . 



Table 2. Secreted reverse transcriptase express 



xon 



Cell" 



RT activity* LacZ Titer c 
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ML/MLV-A 


1 




8xlC 4 


MLSvB 


0 


.1 


< i 


MLCRI? (bulk) 


0 


. 15 


rici 


MLCeB (bulk) 


1 


. 7 




MLCeBl 


4 


.2 


JLX.L U 


MLCeB4 


1 


. 6 


1x1 0 6 


TEL/MLV-A 


3 


6 


^Al u 


TELCeB6 


5 


2 


*± X i. U 


HT1080/MLV-A 


1 . 


1 


IxlO 6 


HTCeBS 


1 . 


9 


1x10 s 


HTCeB18 


2 . 


7 


2x10* 


HTCeB22 (FLY) 


6 . 


9 


5xl0 6 


HTCeB4 8 


5 . 


5 


3x10 s 


EBB 


0 . 


22 


1x10* 


psiCRE-LLZ 


1 . 


2 


lxlO sd 



r M FGnl^'; LU " 11S h " b ° rin3 8 ^°^1"2 proviru..- TEL, cells harboring 

a MFGnlslacZ proviru.; /MLV-A, cells chroni^lly infected- with MLV-A 1S04 
straxn; MLSvB, ML cells transfected wich a plasmid pSVSbsr alone; MLCRIP ML 
cells co-transfected with pCRip and pSV2bsr. 

b: Average of arbitrary units relative to ML/MLV-A RT activity of at least two 
independent experiments was shown. The standard errors did not exceed 20 % of 
the values. 

olas^rTV 11 - TES?1 " 1302 i U - /ml ' transection of a 

!* EB8 IT ^" SSeS ^ MLCSB Cl — ' TELCe * do«.. HTCeB clone, 

ana EB8 cells; nd, not done. 

d: titration on NIH3T3 cells 

To rescue infectious lacZ viruses, MLCeB and TELCeB clones 
were transfected with FBASALF DNA, a plasmid designed to 
express the MLV-A env gene (Fig. i) . Bulk populations of 
stable FBASALF transf ectants were isolated and supernatants 
were titrated using TE671 cells as targets. Titers of lacZ 
viruses were higher than either MLV-A infected ML or TEL 
cells, or FBASALF-transfected EB8 cells (Table 2) . These 
data suggested that CeB was an extremely efficient MLV gag- 
pol expression vector m mink Mv-l-Lu and TE67l" cells. CeB 
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10 



15 



was therefore used to derive packaging cells by transection 
or HT1080 cells. 41/49 blasticidin S-resistant colonies had 
detectable levels of RT; 9 had RT activity higher than that 
of control MLV-A- infected HT1080 cells (data not shown) . 
Expression of gag precursor was confirmed in cell lysates 
and supernatants of these 9 HTCeB clones by immunoblotting 
using antibodies against p3 0-CA (data r^<- , 
clones with the highest expression of gag proteins (clones 
6,18,22 and 48) were infected at high-multiplicity with 
helper free, lacZ pseudotypes bearing MLV-A envelopes 
(MFGnlslacZ (A) ) produced by TELCeB6/FBASALF (Table 3) and 
then transfected with FBASALF. Supernatants of bulk, 
phleomycin-resistant transf ectants were assessed for RT 
activity and lacZ titer (Table 2). Clone HTCeB 2 2 , named FLY, 
was found to be the best gag-pol producer clone and was used 
to introduce env expression vectors for the generation of 
packaging cell lines. 
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Table 3. Titer following env construct: transfect ion 



Producer cell 


Env source 


Titer a 


psiCRIP lacZ 5 


pCRIPAMgag- 


6xl0 4i3 


GP+EAM12 lacZ 25 


envAM 


3xlO Sb 


TELCeB6 


FBASALF C 

FBASAF C 

FbdelPASAF 0 


5x1 0 7 
2xl0 7 
2xl0 7 


TELCeBG 


FBdelPASAF 1 
FbdelPASAF 4 
FbdelPASAF 6 
FbdelPASAF 7 
FbdelPASAF 3 
FbdelPRDSAF 2 
FbdelPRDSAF c 
FbdelPRDSAF 7 
FbdelPRDSAF 8 


3xl0 7 
2xl0 7 
lxlO 7 
5xl0 7 
lxlO 7 
lxlO 6 

OXl U 

lxio 7 


FLY d 


FBdelPASAF 1 
FbdelPASAF 4 
FbdelPASAF 5 
FbdelPASAF 7 
FbdelPASAF 13 
FbdelPASAF 14 
FbdelPASAF 15 
FbdelPASAF 16 
FbdelPASAF 17 


lxlO 1 

1 . 5xl0 6 

1x10 6 

lxlO 6 

7xl0 6 

4xl0 6 

lxlO 6 

5xl0 6 

6xl0 6 


FLYA4 lacZ 3 


FBdelPASAF 4 


2xl0 7b 


FLY d 


FBdelPRDSAF 1 
FbdelPRDSAF 2 
FbdelPRDSAF 6 
FbdelPRDSAF 10 
FbdelPRDSAF 11 
FbdelPRDSAF 13 
FbdelPRDSAF 17 
FbdelPRDSAF 18 
FbdelPRDSAF 19 


2 . 5x10 s 

lxlO 7 

SxlO 6 

2xl0 6 

3x10 s 

lxlO 6 

5xl0 6 

3xl0 7 

6xl0 6 



Average titers of a- least three independent experiments were shov 

standard errors did not exceed 30 V of the titer values. 

a: titrated on TES71 cells as lacZ 1 . u . /ml 

b: results of best MFGnlslacZ producer clones. 
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c: bulk populations of env- t ransf ecc ant s in TELCeBS cells. 

d: titration after bulk infection wich helper- free MFGnlslacZ. 



Example 8 

Construct: ion of env expression vectors. 
~ o^xxes u£ ML.V-A env expression plasmids were then 
generated (Fig. 1). In FBASALF, the env gene was inserted 
between two Friend-MLV LTRs , its expression driven by the 
FB29 MLV LTR (Ferryman et al . , supra) . Most of the packaging 
signal located in the leader region was deleted. This 
plasmid also expressed the phleo selectable marker (Gatignol 
et al., supra) driven by the 3' LTR. FBASAF and FBdelPASAF 
were then designed following the same strategy used for CeB . 
These two vectors differed only by the extent of deletion of 
the packaging signal, FBdelPASAF having virtually no leader 
sequence. Compared to pCRIPAMgag- and pCRIPgag-2 env 
plasmids expressed in psiCRIP or psiCRE packaging cells 
(Danos et al . , supra) about 5 Kb of gag-pol sequences was 
removed. In addition the 258 bp retroviral sequence 
containing the end of env gene and the begining of U3 found 
in pCRIPAMgag- and pCRIPgag-2 was also removed. For both 
FBASAF and FBdelPASAF plasmids, the phleo selectable marker 
was inserted downstream of the env gene by positioning a 76 
nts linker with no ATG codons between the two open-reading 
frames. Phleo could therefore only be expressed by re- 
initiation of translation by the same ribosomal unit that 
had expressed the upstream env open reading frame. 
FBdelPASAF was also used to generate FBdelPRDSAF , an RD114 
envelope expression plasmid (Fig. l) . 

After transfection of the env plasmids into TELCeB6 cells 
(Table 2), bulk populations of phleomycin - resistant colonies 
were isolated and their production of lacZ virus measured 
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(Table 3) . FBASALF gave a titer of SxlO 7 lacZ- i . u . /ml / whilst 
titers with either FBASAF or FBdelPASAF were 2xl0 7 lacZ- 
i.u./ml (Table 3). Titers of 5xl0 7 or 10 7 lacZ- i . u . /ml could 
be obtained with some FBdelPASAF cell clones or FBdelPRDSAF 
clones, respectively. 

As FBdelPASAF has minimal virus-derived sequences and was 
shown to be the safest construct (see below and Table 4) 
it and FBdelPRDSAF were used to generate packaging lines' 
from FLY cells (clone HTCeB22 , Table 2). Envelope express!., 
of these clones was assayed by interference to challenge 
with MFGnlslacZ(A) or MFGnlslacZ (RD) pseudotypes produced by 
TELCeB6 /FBde 1 PASAF - 7 or TELCeB 6 /FBdelPRDSAF- 7 , respectively 
(Table 3) . The cell lines showing most interference were 
cross-infected at high multiplicity with these pseudotypes 
to provide MFGnlslacZ proviruses, and supernatants were then 
titrated on TES71 cells (Table 3) . FLY -FBdelPASAF- 13 (FLYA13 
packaging line) and FLY -FBdelPRDSAF- 18 (FLYRD18 packaging 
line) gave the highest productions of lacZ viruses, around 
10 7 lacZ-i.u./ml. The best MFGnlslacZ producer clones derived 
from either psiCRIP cells JDar.os e: al . , supra) or GP+EAM12 
cells (Markowitz et al . , supra) gave approximately 50 fold 
lower titers (Table 3). The lacZ titers of the FLY-derived 
lines shown in Table 3 are lower than the best TELCeBS- 
derived lines after trans feet ion of either FBdelPASAF or 
FBdelPRDSAF (Table 3). However it should be noted that the 
lacZ provirus expressed in TELCeB 6 cells was obtained after 
clonal selection but was introduced polyclonally in FLY- 
derived env-transfected cell clones. When FLY -FBdelPASAF -4 
cells (FLYA4 packaging line), infected with helper-free 
MFGnlslacZ (RD) , were cloned by limiting dilution the best 
clones (eg. FLYA4lacZ3) were found to produce 20 times more 
infectious viruses than the bulk population, reaching the 
range of titers obtained with the best TELCeB 6 - FBdelPASAF 
clones (Table 3 ) . 
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Example 9 

Assays for transfer of gag-pol or env functions. 
To assay for replication-competent viruses, supernatants 
were used to infect TEL cells (a clone of TE671 cells 
harboring an MFGnlslacZ provirus) . Infected cells were 
passaged for 6 days or longer and their suner^tn-, 
used for infection of fresh TE671 cells. No transmission^ 
lacz viruses could be detected (Table 4), demonstrating that 
the supernatants of pCRIPAMgag- - , FBASALF- , FBASAF- , or 
FBdelPASAF-transfected TELCeBS cells were helper-free. 
Similar absence of replication competent recombinant 
retroviruses was demonstrated using supernatant from a 
clone of psiCRIP-MFGnlslacZ cells or from two clones of 
FLYA- MFGnlslacZ cells (Table 4) . 

There have been reports that helper- free retroviral vector 
stocks may nevertheless contain recombinant retroviruses 
(replication incompetent) carrying either gag-pol or env 
genes (Bestwick et al . , Proc Natl Acad Sci USA (1988), 85, 
5404-5408, Cosset et al . , Virology (1993), 193, 385-395, 
Girod et al : , Virology (1995), in press). To assay for such 
recombinant retroviruses, mobilisation of an MFGnlslacZ 
provirus from two indicator cell lines which could cross- 
complement potential recombinant viruses carrying either 
gag-pol or env functional genes was attempted. The TELCeBS 
line (Table 2) expressing gag-pol proteins was used as 
indicator cell line to test for the presence of env 
recombinant ( ER) viruses. The TELMOSAF indicator line 
expressing MoMLV env glycoproteins (obtained by transfection 
of FBMOSAF, a plasmid expressing the MoMLV env gene using 
FBASAF backbone, in TEL cells) was used to detect the 
presence of gag-pol recombinant retroviruses (GPR viruses) 
After passaging 4-8 days, the supernatants of the infected 
indicator cells were used to infect either human TE671 cells 
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or murine NIH3T3 cells. 



TELCeBS cells transfected with various env-expressing 
constructs, pCRIPAMgag-, FBASAF and FBdelPASAF were 
compared. Although the supernatants of TELCeB 6 -FBdelPASAF 
cells were devoid of replicat ion- competent retroviruses, 
they were found sporadically to transfer gag-pol genomes 
(Table 4) . No GPR viruses could be detected when less than 
2x10 s virions were used to infect the indicator cells. 
Similarly TELCeBG indicator cells infected with various 
helper-free viruses were shown sporadically to release lacZ 
virions (Table 4) . The number depended both on the env- 
expression vector used and on the virus input quantity. 
Compared to lacZ viruses generated using pCRIPAMgag- 
plasmid, the frequency of detection of the env- recombinant 
viruses was lower for supernatants generated by using FBASAF 
and FBdelPASAF constructs (Table 4). For FBdelPASAF 
construct when less than 5x10 s MFGnlslacZ (A) helper-free 
virions were used to infect the indicator cells, no ER 
retroviruses could be detected. From these experiments, it 
could be estimated that a supernatant, produced from 
TELCeB 6 -FBdelPASAF cells, containing lxlO 7 infectious units 
of MFGnlslacZ retroviral vector contained no replication- 
competent virus, and about 100 gag-pol and 100 env 
recombinant retroviruses. 
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Table 4. Transfer of packaging function 



Producer cell 



TELCeB6-pCRIPAMgag- 
TELCeB6-pCRIPAMgag- 
TELCeB6-pCKD?AMgag- 

TELCeB6-FBASAF 
TELCeB6-FBASAF 
TELCeB6-FBASAF 

TELCeB6-FBdelPASAF 
TELCeB6-FBdelPASAF 
TELCeB6-FBdelPASAF 



Indicator cell input virus' Detecting" 
(lacZ-i.u.) ++ + 



psiCRIP lacZ 5 


TEL 


Replication compprpnr vj ni <; 
2xl0 4 0/4 


074 


4/4 


T"PT (~**.T>C riTi-. » » 


TEL 


5x1 0 6 


0/4 


0/4 


4/4 


TELCeB6-FBASAP 


TEL 


5x10* 


0/4 


0/4 


4/4 


TELCeB6-FBdelPASAF 


TEL 


5x10* 


0/4 


0/4 


4/4 


FLYA4 lacZ 3 
FLYA4 lacZ 7 


TEL 
TEL 


lxlO 7 
lxlO 7 


0/4 
0/4 


0/4 
0/4 


4/4 
4/4 



TELCeB6-FBdelPASAF 7 
TELCeB6-FBdelPASAF 7 
TELCeB6- FBdelPAS AF 7 
TELCeB6-FBdelPASAF 7 



Gag-pol r ecombinant 



TELMOSAF 


2xl0 7 


0/4 


1/4 


3/4 


TELMOSAF 


2xl0 6 


0/4 


2/4 


2/4 


TELMOSAF 


2x10 s 


0/4 


2/4 


2/4 


TELMOSAF 


2x10* 


0/4 


0/4 


4/4 


Env recombinent 


TELCeB6 


5X10 6 


2/4 


1/4 


1/4 


TELCeB6 


5x10 s 


1/4 


1/4 


2/4 


TELCeB6 


SxlO 4 


0/4 


2/4 


2/4 


TELCeB6 


5x1 0 6 


0/4 


2/4 


2/4 


TELCeB6 


SxlO 5 


0/4 


1/4 


3/4 


TELCeB6 


SxlO 4 


0/4 


1/4 


3/4 


TELCeB6 


5xl0 6 


0/4 


1/4 


3/4 


TELCeB6 


5x10 s 


1/4 


3/4 


0/4 


TELCeB6 


5xl0 4 


0/4 


0/4 


4/4 



a: number of lacZ i. u . used to infect indicator cells 

b: number of incidence out of four experiments. The ranges of lacZ titers 
rescued from infected indicator cells are shown for each virus input: >100 
lacZ i.u. /ml <+♦), i-ioo lacZ i . u . /ml (+) and <l lacZ i.u./ml (-). 

Titers were determined on TE671 cells for replication 
competent virus and env recombinant and NIH3T3 cells for 
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gag-pol recombinant: . 



Example 10 

In order to confirm resistance to complement and absence of 
replication competent virus in our best packaging l ines 
MFGnlslacZ (A) and (RD) harvested from FLYAl 3 and FLYRD1S 
respectively, after polyclonal transduction of MFGnlslacZ 
(Table 3 above) were tested for stability in fresh human 
serum and generation of replication competent virus Tit=rs 
of MFGnlslacZ (RD) from FLYRD18 after l hr incubation with 3 
independent samples of fresh human serum were 80 to 120 % 
of control incubations, while titers of MFGnlslacZ (A) f>- om 
FLYA13 were 50 to 90 % of controls (data not shown) . No 
replication competent virus was detected in the same assay 
described above (Table 4) when 1 x 10 7 i. u . each of 
MFGnlslacZ (A) and (RD) were tested. 



EXAMPLE 1 1 



Generation of plasmids . 

CeB plasmid (Fig. 5 ) expressing MoMLV gag-pol gene, was 
further modified to remove the splice donor site located in 
the leader region. A 272 bp fragment was PCR-generated by 
using OUSD- ( 5 ' -TCTCGCTTCTGTTCGCGCGC) and OLSD- 
(5 ' "TCGATCAAGCTTGCGGCCGCGGTGGTGGGTCGGTGGTCC) as primers and 
further digested with BssHII and Hindlll. A 1008 bp 
Hindlli-xhol fragment isolated from CeB (encompassing a part 
of leader sequence and beginning MoMLV gag) and the PCR 
fragment were co-inserted into P CeB from which the 1275 bp 
BssHII-XhoI fragment (encompassing R-US-leader-gag) had been 
removed. The resulting plasmid, named pCeB DS- (Fig. 5), 
beared the deletion of splice donor (SD) site and a NotI 
restriction site created just downstream to the lost SD 



site . 
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L-TZL ' ~P«s 5 io„ plas m id s i„ which the MoMLV 

^° m °" r b r hu M „ cytomegalovirus 

««0«« propter <hCKV proper, „« derived frQm 

Notl-f.lled/EcoRZ , 250 bp ,„»« „ as isolated from b A 

and cloned into hCMV-G which had fc™ j .' . 

(further rendered blunt-ended) and" ' EaoZ^Trl^^ vsv . g 
gene. The resulting plasmid was cutted ^ clai « ™ 

r 0 rrL a T ^ fragTO£nt SnC — ^.n« derived 

flTlL'l J SV4 ° P ° lyA SeqUSnCe -"-li ff «t«I after 

filling both protruding DNA ends. The resulting pl asmid 

named phCMV-intron (Fig. 5) . had gag-pol and bar ORFs ' 

inserted between the CMV promoter and rabbit beta-globin 

polyA post-transcriptional regulatory sequences. 

An intermediate plasmid was generated by sub-cloning a 72 60 

wlthT 1 ' ragmSnt (isolat ^ from CeB DS-) into hCMVO opened 
with EcoRI . A X15S bp fragment (encompassing sequence 
derrved from LTR and SV40 polyA sequence) was removed 
from this intermediate plasmid which was then 
re-circularized by self ligation after filling both ends . 
me resulting plasmid, named phCMV+intron 2P (Fig 5 ) was 
digested with Not I and the vector was treated with klenow 
enzyme A 1440 bp fragment (encompassing hCMV promoter and 
rabbit beta-1 globin intrcn 3 (Rohrbaugh et al . , i 985 Mol 
Cell Bxol, 5: 147-iso)) was isolated from P hCMV + intron 2V by 
Notl/EcoRI digestion. This fragment was further treated with 
klenow enzyme and ligated back into the vector. The 
resulting plasmid, named hCMV+intron (Fig. 5), could express 
gag-pol and bsr genes dr.ven by the hCMV promoter and beared 
an xntrcn sequence derived from rabbit beta-i globin xntron 
B having both SD and SA (splice acceptable) sites. 

A 2450 bp fragment was removed from phCMV+intron 2P by 
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Notl/Xhol digestion. The resulting vector fragment was then 
used to co-ligate a 1330 bp fragment (containing hCMV 
promoter + 5' end of rabbit beta-l globin intron B (with SD 
site)) isolated from phCMVG by Apal - filled/Not I digestion 
and a 1 kb fragment isolated from phCMV+intron 2P by 
Notl-filled/XhoI digestion. Compared to phCMV + intron '2 P, the 
resulting plasmid, named hCMV+SD intron (Fig. 5), had the 
deletion of the 3' end of the rabbit beta-l globin intron B 
and thus no SA site in the leader region. 

Construct phCMV+leader (Fig. 5) has been described elsewhere 
(Savard et al. # unpublished). This plasmid, in which gag-pol 
and bsr genes were driven by the hCMV promoter, had the 
MoMLV SD site in the leader region. 

Gag-pol expression . 

The different constructs, including the parental CeB 
plasmid, were analysed comparatively in a complementation 
assay after transf ect ion in TEL - FBde 1 PASAF cells expressing 
4070A-MLV (amphotropic) envelope and harboring a MFGnlslacZ 
provirus. The transient production of lacZ retroviruses as 
well as the stable production of lacZ retroviral vectors 
after selection with blasticidin S were determined (Table 
5) . All the constructs were able to rescue infectious lacZ 
retroviruses indicating the expression of gag-pol proteins 
after transient transf ection . Most likely due to the 
efficient hCMV and rabbit beta-l globin intron B 
(post) -transcriptional regulatory sequences, hCMV+intron was 
particularly potent in transient retroviral vector 
production. However, 10 times less bias ticidin-resist ant 
colonies were obtained with hCMV-intron comparatively to 
CeB, and stable lacZ virus production from hCMV+intron was 
about 5-10 times lower than that of CeB. Clonal examination 
of lacZ retrovirus production from bias ticidin-resist ant 
colonies indicated that 80-90% of colonies could express 
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hxgh levels of gag-pol proteins for both hCMV +ln tron and CeB 
Plasmas, in contrast, despite varxatxon in their ability to 
form olastxcidin-resistant colonies after transection and 
despite their ability to express gag-pol proteins from 
transient transf ectants , all other constructs had a' weak 
capacity for rescuing lacZ retroviral vectors from stable 
transfectants (Table 5) . 



Comparative study of gag-pol-bsr plasmids 



gag-pol -bsr 
plasmid 


Transient 
(lacZ 
i .u./ml) 


no clones 
bsr* 


Stable 
(lacZ 
i -u. /ml 


% gag-pol 
/bsr 


Ceb 


300/ml 


50 


10 7 


90% 


Ceb DS- 


144/ml 


5 


10 s 


50% 


hCMV+ intron 
2P 


ND 


20 


10 6 


50% 


hCMV- intron 


812/ml 


0 






hCMV+SD 
intron 


15 0 /ml 


1000 


10 2 


nd 


hCMV+leader 


328/ml 


1000 


10 2 -10 3 


nd 


hCMV+ intron 


12000/ml 


5 


10 6 -10 7 


80% 



Northern blot analyses were performed on stable 
transfectants (blasticidin-resistant) obtained with some of 
the gag-pol-bsr plasmids. As expected, the results (not 
shown) displayed a correlation between expression of gag-pol 
mRNAs and gag-pol protein expression detected by rescue 
analysis (Table 5) . CeB construct was found to produce 2-3 
fold more gag-pol mRNAs compared to hCMV+ intron. 
Interestingly, an unexpected 2.45 kb RNA band was found for 
hCMV + intron construct at a ratio of 2:1 compared to the 
abundancy of the gag-pol mRNA band (at 5.95 kb) . Further 
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investigations by using other probes revealed that a cryptic 
splo.ce donnor ( SD ) site located in the gag gene (right in 
the middle of the CA coding region at position 1596-1597 
-numbering according to Shinnick et al . , 1981 Nature 
(London) 293: 543-548) was activated in this latter 
construct. The 2.45 RNA species, lacking the 3' half 6f the 
gag gene and most of the pol gene, is unlikely to give rise 
to any useful translational product. It is therefore 
interesting to notice that hCMV + intron construct was able to 
give rise to slightly more transcripts (gag-pol 5.95 mRNA + 
2 . 45 alternative RNA band) compared to gag-pol mRNA 
expressed from CeB construct. Therefore we decided to 
inactivate the cryptic SD site in the hCMV + intron construct 
in order to increase the ratio of gag-pol mRNAs . 

Assays for transfer of gag-pol functions. 

Although the supernatants of pacakaging cell lines generated 
with CeB gag-pol expression contruct were devoid of 
replication-competent retroviruses, they were found 
sporadically to transfer gag-pol genomes (example 9, Table 

4) (Cosset et al., 1995 J. Virol 69: 7430-7436). Because 
gag-pol-bsr_ constructs generated here by using the hCMV 
promoter had much less retroviral sequences homologous to 
the retroviral vector than the parental CeB construct (Fig 

5) , they are less likely to give rise to gag-pol recombinant 
(GPR) viruses. Therefore, the most efficient gag-pol-bsr 
plasmids, hCMV+intron and CeB, were further analysed for 
emergence of GPR viruses. To assay for such recombinant 
retroviruses, we attempted to mobilise an lacZ provirus from 
an indicator cell lines which could cross -complement 
potential recombinant viruses carrying gag-pol functional 
genes. Results displayed in Table 5 showed that consistently 
with data reported previously (example 9, Table 4) (Cosset et 
al., 1995 Supra), lacZ retrovirus vectors generated by using 
CeB gag-pol construct were contaminated with GPR viruses. In 



WO 97/08330 



PCT/GB96/0206I 



37 

contrast lacZ retrovirus vectors generated by using 
hCMV+intron construct were completely devoid of such GPR 
viruses, suggesting that this construct was improved 
compared to CeB with respects with emergence of recombinant 
viruses. 



Table 6, Comparative study of gag-pol-bsr r>la«smiHo 



plasmid 


input virus 
(lacZ i .u. ) a 


no of experiments 
giving titres of b 


CeB 


5x10 s 


5 


3 


0 




5x10 s 


2 


4 


2 




5xl0 4 


0 


1 


7 


hCMV+intron 


SxlO 6 


0 


0 


8 




5x10 s 


0 


0 


8 




5xl0 4 


0 


0 


8 



4X10E4 cells of TEL/MOSAF xn 24 wells were challenged with lacZ (A) of i.u. 
indicated in the table (a), and incubated at 3 7"C for 3 days. Cells were 
trypsinized and transferred into small flasJcs. Cell sup was harvested on day 5 
after lacZ (A) challenge and plated on either TE571 (not shown) and 3T3 cells 
(b) . No lacZ was mobilized into TE671 at all. LacZ (A) from CMV-int 10 again 
did not rescue lacZ from TEL/MOSAF. 



Example 12 



Generic primers to detect D-type (Medstrand and Blomberg 
J.Virol. (1993) 67:6778-6787) , C-type (Shih et al . , J 
Virol. (1989) 63:64 -75), human endogenous vims RTVL-H 
(Wilkinson et al., J. Virol. (1993) 67:2981-2989), by RT-PCR 
were employed (Patience et al w supra) . Primers to detect 
mouse endogenous VL3 0 element (Adams et al Mol . Cel . Biol . 
(1988) 8:2989-2998), and MFGnlsiacZ RNA were designed and 
synthesized (TABLE X) . Overnight supernatants (in 4ml of 
culture medium) from 106 cells of GP+EAM12 lacZ25 , FLYA4lacZ3 
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and TELCeBSFBASALF cells (Table 3) were harvested and" 
centrifuged in sucrose gradient as described previously 
(Patience et al . , J.virol., 70:2654-2657). Fractions 
containing retrovirus particles were collected, and RNA 
extracted. One twentieth of the RNA preparation or 
dilution's thereof were applied to RT-PCR as described" 
previously (Table X) . A 1/200 of RNA harvested from 
GP + EAM121acZ25 cells was positive for VL3 0 RNA. MFGnlslacZ 
RNA was found from 1/20 of RNA from GP+EAM121acZ and 
TELCeBSFBASALF cells and 1/200 of RNA from FLYA41acZ3 cells 
The primer combinations for RTVL-H, C- and D-type RNA did 
not give detectable PCR product. 

Table 7. RT-PCR detection of endogenous retrovirus RNA 
associated with virus particles. 



rt-pcr of virion associated RNA from* 

RNA primer (5' -3') GP*EAM12 FLYA4 TELCeBSF 

forward (F) /reverse (R) lacZ2S i acZ 3 BASAL F 



Ho 



MFGnls F ) CTCTGGCTCACAGTACGACG 

lacZ R) C CAT CAAT C CGG T AGGTTTT Z C 



C- type F) CARRGKTTCAARAACWSYCCCAC 

R) AGYAR VGT AGCNGGG TTHAGG 

D-type F) TCCCCTTGGAATACTCCTGTTTTYGT - 

R) CATTCCTTGTGGTAAAACTTTCCAyTG 

RTVL-K F) CCTCACCCTGATCACRYTTG NT 

R) G AATT AT G T CT G A C AG AAG GG 

VL3 0 F) GTTGACATCTGCAGAGAAAGACC + + 

R) TCTGAGGTCTGTAGACACAATGG 



NT . NT 
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genes. nou Possess the corresponding 

EXAMPLE 13 . 



Generation of aa<_r-T5~i _ _ - __ 

- - - - * ^ej-Lii r> y using TE671 

cells 



CeB, a plasma designed to over-express MoMLV gag and pel 
protexns was introduced in TE671 human rhabdomyosarcoma 
cells (ATCC CRL8805). After selection wxth blasticidin 50 
bsr-positive colonies were isolated and the RT (reverse 

TzTZTrT, activity was analysed in their 

12 TE671-CeB (TECeB) clones with high RT activity were 
selected for further analyse. The best TECeB clone, clone 
#15, had a RT activity roughly equivalent to that TELCeBS 
cells (Cosset et al . , j. virol . 69:7430-7436 (1995)- see 
also Example 7, Table 6 in this patent application) but 
dxsplayed 2-3 fold more gag-precursors ln to cells as 
demonstrated in immunoblots by using anti-CA antibodies The 
bxologxcal activity of gag-pol protexns expressed in the si, 
best TECeB clones was further confirmed by their ability to 
produce xnfectious retroviruses in a complementation assay 
A lacZ provirus was introduced into each of the TECeB clones 
by polyclonal cross-inf ectxon by using lac2(RD114) helper- 
free retrovirus vectors. FBMOSALF, a MoMLV env expression 
Plasmid (Cosset et al., j. virol. 69:6314-6322), was then 
transfected in each of the TECeB-lacZ lines and in the 
TELCeB6 cell line for comparxson. After selection with 
phleomycin, the titer of lacZ retrovxrus vectors was 
determined in the supernantant of pools of phleomycin- 
resxstant colonies for each TECEB-lacZ- FBMOSALF Ixnes. A 
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good correlation wa a found between gag-pol expression" into 
the TE-CeB clones (as determined by RT-assays and anti-gag 
immunoblots) and their ability to release infectious lacZ 
particles. TE-CeB15 cells could release approximately the 
same number of lacZ particles when compared to TELCe36 cell 
although TELCeB6 cells had the advantage of being selected 
for lacZ expression (Cosset et al., J. Virol. 69:7430-7436 
(1995)). TE-CeB15 cells were therefore used to derive 
retroviral packaging cell lines. 

Construction of env-eacpression plasmids . 

A series of plasmid (Fig. 3) was designed to allow 
expression of different retroviral envelope genes (isolated 
from MoMLV, GALV -Gibbon Ape Leukemia Virus-, and MLV-10A1) 
FBde 1PMOSAF (Fig. 3, nucleotide sequence in -Fig. 10) and 
FBdelPlOAlSAF, expressing ecotropic MoMLV or MLV-10A1 
envelopes, were generated by replacing the Bglll/Clal 
fragment from FBdelPASAF (Cosset et al., J. Virol. 69:7430- 
7436 (1995); see also Example 7, Fig. 2 and nucleotide 
sequence in Fig. 9) encompassing most of the env gene and 
splice acceptor site with that of MoMLV (position 5407 to 
7679, Shinnik et al., 1981) or with that of MLV-10A1 (Ott et 
al., J. Virol. 64:757-766 (1990)). 

Nucleotides 7514-7516 of GALV (Delassus et al., Virology 
173:205-213 (1989)) were mutated by PCR-mediated mutagenesis 
to create a Clal site (AAG to CGA) , thereby introducing a 
conservative modification (a lysine (amino-acid 665 of GALV 
env precursor) to an arginine) . The BamHI/Clal fragment (nts 
4994 (Delassus et al. Virology 173:205-213 (1989)) to 7517) 
was then sub-cloned into FBdelPASAF m which the 3glII/ClaI 
encompassing most of the env gene and splice acceptor site 
had been removed. The resulting plasmid, expressing GALV 
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envelope glycoproteins, was named FBdelPGASAF [Fig. 3, 
nucleotide sequence in Fig. 11). 

CMV10A1 was generated by inserting a Klencw enzyme- fi 1 led 
Eagl/Sall fragment from FBdelPlOAlSAF ( encompassing - 1 0A1 MLV 
env gene and phleo selectable marker) into hCMV-G digested 
with BamHI and filled with Klenow enzyme. The resulting 

n las mid WinM /r-; -> j 

------ 3- - naL.xeOLj.ue sequence in Fig. 13) 

could express 10A1 envelopes under control of the hCMV 
promoter and the phleo selectable marker by translation re- 



initiation , 



Generation of a multi-tropic set of TE671-based retroviral 
packaging lines . 

FBdelPRDSAF (Fig. 3, nucleotide sequence in Fig. 12), 
FBdel PASAF, FBdelPGASAF, FBdel PMOSAF and FBdelPlOAlSAF were 
independently introduced into cells of the TE-CeB15 pre- 
packaging line, expressing MoMLV gag-pol proteins. 
Transfected cells were phleomycin-selected and 15-20 phleo- 
resistant colonies were isolated for each env-expression 
plasmid transfected . 

Individual colonies were then analysed for expression of 
envelope glycoproteins by immunoblots on cell lysates by 
using antibodies against RD114 SU glycoproteins or against 
Rausher leukemia virus SU (to screen MoMLV, MLV-4 070A and 
MLV-10A1 env-producer clones) or against GALV. The best env- 
producer colonies as determined in this assay were further 
analysed by a complementation assay after introducing a lacZ 
retroviral vector. LacZ pseudotypes released from the 
different packaging cell lines were titrated by using NIH 
3T3 cells or TE671 cells as target. Titers higher than lxlO 7 
lacZ i.u./ml were obtained for the best clones. .Depending on 
the envelope specificities expressed in these cells, the new 
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TE671-based retroviral packaging cell lines were named - E - 
FLYE, TE-FLYA, TE-FLYRD, TE-FLY10A1, and TE-FLYGA and could 
express the MoMLV, MLV-4070A, RD114, MLV-10A1, and GALV env 
genes , respectively. 

Assays for detecting replication-competent retroviruses 
(RCRs) were performed in the supernatants of these cells and 
were negative (less than 1/ml) . 

TE671 cells are very potent for transient expression 
resulting in more than 95% of cells expressing transgene 
three days after plasmid transfection (Hatziioannou and 
Cosset, unpublished data, (1996)). The ability of retroviral 
packaging cell lines to transiently produce retroviral 
vectors is of crucial importance for gene therapy where 
vectors carrying toxic gene have to be prepared. Transient 
expression of retroviral vectors was comparatively 
determined from cells of the TE-FLYA line and from the BING 
line (Pear et al., Proc Natl Acad Sci U S A 90, 8392-6 
(1993)), a retroviral packaging cell line designed to 
transiently express retroviral vectors. Results (Table 8) 
showed that 'TE-FLYA cells were more efficient for transient 
expression of a lacZ retroviral vector hence resulting in 
higher titers. 



Table 8 . Comparative study of transient production o£ lacZ 
vectors . 



packaging 
cell line 



BING 



TE-FLYA 



cell number* 



281 



117 



% transfected 
cells b 



5.3 



35 



transient 
titer c 



2x10' 



1 . 3xl0 3 



Lel13 * = ^ Mrcnis. acz retrial vecLo„ ^ caelum PhQ3ahate 

P=ecipi:ation method and titers of of lacZ vectors !c) released in cell 
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10 



15 



20 



25 



:::i:::r::::;'rir;: *- *~~ - ~ > — - 

^ ellS ,a > <^era?e per microscope field) and the % o- 
transtected ceils fb) de termined after x . gal ^ 

Retroviral vectors prepared from TE671-based packaging c -ll 
lines were analysed for their sensitivity to human- 
complement mediated inactivation . Experiments were conducted 
as previously described (Cosset et al., J. Virol, fie?.™. 

U*9S>; see also Example 10 in this patent application) 
by using three human sera of individual donnors (Table 9, 
As expected MLV-A prepared from mouse 3T3 cells were highly 
sensitive to inactivation after 1 hr incubation witn sera 
in contrast, titers of lacZ vectors produced from TE-FLYRD 
cells were 17 to 55% of control incubations, while titers of 
lacZ vectors from TE-FLYA cells were 1 to 30% of controls. 

Table 9. Human serum sensitivity of viruses produced from 
TE671-based packaging cell lines. 



Virus from : 



3T3/A 



TE-FLYE 



TE-FLYA 



TE-FLYRD 



Three human 
BTS (AB+ 



hu56» 



<0.2, <0.2 



15, 7.8 



1, 0.6 



17, 22 



hu57* 



<C2, <0.2 



16, 11 



2.2, 7.1 



30, 44 



BTS* 



<0.2, <0.2 



48, 60 



28, 19 



54, 63 



uman fresh serum samples were testia m duplicate- r-uSS <a + i t. 

»■ (.) % control Uver age f3r nrs and optl^^ent'^ ' sho^* 
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CLAIMS : 

1. A recombinant expression vector comprising a gene of 
interest and a selectable marker gene, wherein the 
selectable marker gene is arranged downstream of the 
gene of interest and a st6p codon associated with the 
gene of interest is spaced from a start codon of said 
selectable marker gene at a distance which is 
sufficient to ensure that said selectable marker 
protein is expressed from the corresponding mKNA as a 
result of translation reinitiation. 

2. A recombinant expression vector according to claim 1 
wherein the vector is a viral vector. 

3. A recombinant expression vector according to claim 2 
wherein the vector is a retroviral vector. 

4. A recombinant expression vector according to any one 
of claims 1 to 3 wherein the gene of interest is 
included as part of a viral packaging construct:. 

5. A recombinant expression vector according to any one 
of the preceding claims wherein the number of 
nucleotides in the space between the stop codon of the 
gene of interest and the start codon of the selectable 
marker is in the range of from 20 to 200 nucleotides. 

6. A recombinant expression vector according to claim 5 
wherein the number of nucleotides in the space between 
the stop codon of the gene of interest and the start 
codon of the selectable marker is in the range of from 
60 to 80 nucleotides. 

7 . A process for producing a cell line in which a gene of 
interest is expressed, which process comprises: 

transforming host cells wich an expression vectcr 
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according to any one of the claims 1 to 6; and 

selectable those cells where expression of the 
selection marker gene may be detected. 

8- A process according to claim 7 wherein the host cell 
is a eukaryotic cell. 



9. A host cell transformed with a recombinant expression 
vector according to any one of the claims 1 to 6 

10. A retroviral packaging cell line comprising a host 
cell transformed with a first and a second recombinant 
expression vector, said first recombinant expression 
vector having a packaging- deficient construct 
comprising a viral gag-pol gene and a first selectable 
marker gene downstream thereof, and said second 
recombinant expression vector having a packaging- 
deficient construct comprising a viral env gene and a 
second selectable marker gene downstream thereof; 
wherein the start codon of the first and second 
selectable markers are spaced from the stop codons of 
the viral gag-pol gene and the viral env gene 
respectively by a distance which ensures that said 
selectable marker protein is expressed from the 
corresponding mRNA as a result of translation 
reinitiation . 



A retroviral packaging cell line according to claim 10 
wherein the first selectable marker is a bsr 
selectable marker and the second selectable marker is 
a phleo selectable marker. 



A retroviral packaging cell line according to any one 
of claims 10 or 11 wherexn the packaging-deficient 
construct comprising the viral gag-pol gene and firs- 
selectable marker is the CeB (SEQ ID No 2) expression 
construct . 
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13 . 



A retroviral packaging cell line according to any one 
of claims 10 or 11 wherein the packaging-deficient 
construct comprising the viral env gene and second 
selectable marker is the FBdelPASAF (SEQ ID No 5) 
the FBdelPMOSAF (SEQ ID No 6), the Fbde 1 PGASAF (SEQ ID 
No 7), the Fbde 1 PRDSAF (SEQ ID No 8), the FbdelPXSAF 
(Fig. 3), the FbdelPlOAlSAF (Fig. 3), or the 
FBdelPVSVGSAF (Fig. 3) expression construct. 

14 . A retroviral packaging cell line according to any one 
of claims 10 or 11 wherein the recombinant expression 
vector is a packaging-deficient retroviral helper 
construct . 

15 . A retroviral packaging cell line according to claim 14 
wherein the overlapping sequences between the genomes 
of the retroviral vector and the packaging-deficient 
construct is reduced by minimizing the extent of non- 
coding retroviral sequences in the packaging-deficient 
genome . 

16. A retroviral packaging cell line according to any one 
of claims 10 to 15 wherein the viral gag-pol gene and 
the selectable marker are expressed under the control 
of a non-retroviral promoter. 

17 . A retroviral packaging cell line according to claim 16 
wherein the promoter is fused to rabbit beta-1 globin 
intron. 

13. A retroviral packaging cell line according to claim 16 
or claim 17 wherein the promoter is a hCMV promoter. 



19 



A retroviral packaging cell line according to any one 
of claims 16 to claim 13 wherein the viral gag-pol 
gene and the selectable marker is a hCMV + intron (SEQ 
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ID No3) or a hCMV+innronkaSD {SEQ ID No 4 ) expression 
construct . 

20. A retroviral packaging cell line according Co. anyone 
of claims 10 to 15 wherein the viral env gene and the 
selectable marker are under the control of a non- 
ret rovir= 1 nv^m^^sr- 

21. A retroviral packaging cell line according to claim 20 
wherein the promoter is fused to rabbit beta-l globin 
intron . 



A retroviral packaging cell line according to claim 20 
or claim 21 wherein the promoter is a hCMV promoter. 

A retroviral packaging cell line according any one of 
claims 20 to 22 wherein the vxral env gene and the 
selectable marker is a CMV10A1 (SEQ ID No 9) 
expression construct. 



A retroviral packaging ceil line according to any one 
of claims 10 to 23 wherein the cell line is the HT1080 
line, the TE671 line, the 3T3 line, the 293 line or 
the MV-1-1U line. 



25. A retroviral packaging cell line according to anyone 
of claims 10 to 24 wherein the retroviral packaging 
cells comprises human HT1080 cells and express RD114 
envelopes . 



A retroviral packaging cell line according to anyone 
of claims 10 to 24 wherein the retroviral packaging 
cells comprises human TE671 cells and express RD114 
envelopes . 
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27. A process for producing a retroviral packaging cell 
line in which a gene of interest in expressed, which 
process comprises : 

transforming host cells with a first and a second 
recombinant expression vector, said first recombinant 
expression vector having a packaging-deficient 
construct comprising a viral gag-pol gene and a first 
selectable marker gene downstream thereof, and said 
second recombinant expression vector having a 
packaging-deficient construct comprising a viral env 
gene and a second selectable marker gene downstream 
thereof; wherein the start codon of the first and 
second selectable markers are spaced from the stop 
codons of the viral gag-pol gene and the viral env 
gene respectively by a distance which ensures that 
said selectable marker protein is expressed from the 
corresponding mRNA as a result of translation 
reinitiation; and 

selecting transformed cells which express said 
first and/or second marker genes. 

28. A packaging deficient construct for use in a process 
according to claim 27. which expresses a viral gag-pol 
gene and a selectable marker wherein a start codon of 
the selectable marker is spaced from a stop codon of 
the viral gag-pol gene by a distance which ensures 
that said selectable marker protein is expressed from 
the corresponding mRNA as a result of translation 
reinitiation . 

29. A packaging deficient: construct for use in a process 
according tc claim 27, which expresses a viral env 
gene and a selectable marker gene,- wherein a start 
codon of the selectable marker is spaced from a stop 
codon cf the viral env gene by a distance which 
ensures that said selectable marker protein is 
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expressed from the corresponding mRNA as a result of 
translation reinitiation. 
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.de) psi 



MLV-LTR 



5D 



7 ^7 ^ 


7 1 


r 






Mo MLV GAG-POL | 


BSR 








pcly A 


I ; 













SA 



pel gene.. TCT AG A CTG AC A T3 G CG'C 
GTT CAA CGC TCT CAA AAC CCC TTA 
AAA ATA AGG TTA ACC CGC GAG GCC 
CCC TAA 

tccccnaattcttctcatgctcacaggggtcagtac 
tgcncgcccgcaccaqtqc oqcccaGccoq ccaf^ 

AAA ACA TTT AAC ATT TCT... bsrgene 



Figure 1. Schematic structure of CeB expression vector 
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.del psi 
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poly A 



SA 



-i 1 



e/7i/£ene...TAC GAG CCA TAG 
gqcqcc tagtgttgacaattaatcatcggcatagtata 
cggcatagtataatacgactcactataggagggccacc 
A7BGCC AAG TTG ACC.phleo gene 



Figure 2. Schematic structure of FBdelPASF expression 
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F3de!PVSVGSAF Kzs\ 
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Figure 3. Schematic structure of env expression vectors 
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NG AGCTC AGGAC AGGTAG AAAG AATG AATAG AAC AATAAAAGAG ACCCTTACT AAATTr a c n 

CCTTAGAGACTGGCTTAAAAGATTGGAGACGCCTCCTATCTCTGGCTTTGTTAAGAGCCA f?n 
G AAATACGCCC AACCGTTTTCGGCTCACCCC AT ATG AAATCCTTTATGGGGG ^CCCCcP 7 1 n 

CTTTGTCAACCTTGCTCAATTCCTTCTCCCCCTCCGATCCTAAGACTGATTJicA^Sr pfS 
GACTAAAA.GGGCTGCAAGGCGTGCAGGCCCAAATCTGGACACCCCTGGCCGA^TTGTACr ?™ 
GGCC AGGAC ATCCACAAACTAGCCACCCATTTCAGGTGGGAGACTCCGTGTACGTCCGCr 1 2n 
GGCACCGCTCTCAAGGATTGGAGCCTCGTTGGAAGGGACCTTACATCGTCCTGCTGACCA Jn 
CGCCCACCGCCATAAAGGTTGACGGGATCGCCGCCTGGATTCACGC\TCGC\CGCCAACr ISn 
C AGCCCCAAAAAC CCCTGGACCAGAAACTCCCAAAACCTGGAAGCTCCGCCGTTCr^I i , 2 
ACCCTCTTAAGAT AAGACTCTCCCGTRTTTr, ACTCCTAATrr^rc^^.-.?-! 5S?S*? A 540 



AATCAGCTCCTACAGTCC^CCTTGTAGGGGCT "20 

ACAGCCCCCATCCATATCTCCGATGGTGGAGGACCCCTCGATACTAAGAGAGTCTO^ 1140 
GTCCAAAAAAGGCTAGAACAAATTCATAAGGCTATGACTCCTGAACTTCAATACCACCCC 1?SS 
TTAGCCCTGCCCAAAGTCAGAGATGACCTTAGCCTTGATGCACGCSACTm 12sS 

tgtttaaaactaggtacccctacccctcttgcgatacccactccctctttaacSactc? 1 1 r n 
ctagcagactccctagcgaatgcctcctgtcagattatacctcccctcttgcttca^S 144? 
atgcagttctccaactcgtcctgtttatcttcccctttcatta^cgatacSaacaIS?? liio 
gag ^ aggtgcagtcac ctttactaactgcacctctgtagccaatgtcagt^cm lleo 
tgtgccctaaacgggtcagtcttcctctgtggaaataacatggcatacacctatttaccc 1620 
caaaactggaccagactttgcgtccaagcctccctcctccccgacattgacaJcaac^cc^ les? 
ggggatgagccagtccccattcctgccattgatcattatatacatagaccta^c^aS "S§- 

GTACAGTTCATCCCTTTACTAGCTGGACTGGGAATCACCGCAGC ATTCACCACCGGAGCT 1800 

acaggcctaggtgtctccgtcacccagtatacaaj^ttatcccatcagttaatatctgS iqso 
gtccaagtcttatccggtaccatacaagatttacaagaccaggtagactcgttagctgaa ~ 9 2 0 

GTAGTTCTCCAAAATAGGAGGGGACTGGACCTACTAACGGCAGAACAAGGAGGAATTTGT 19 80 
TTAGCCTTACAAGAAAAATGCTGTTTTTATGCTAAC AAGTCAGGAATTGTGAGAAACAAA 2040 
ATAAGAACCCTAC AAGAAGAATTACAAAAACGCAGGGAAAGCCTGGCAACCAACCCTCTC 2100 
TGGACCGGGCTGC AGGGCTTTCTTCCGTACCTCCTACCTCTCCTGGGACCCCTACTCACC 2160 
CTCCTACTCATACTAACCATTGGGCCATGCGTTTTCAGTCGCCTCATGGCCTTCATTAAT 2 220 
GATAGACTTAATGTTGTACATGCCATGGTGCTGGCCCAGCAATACCAAGCACTCAAAGCT 22 80 

GAGGAAGAAGCTC AGGATTGAGCTTCCGGGAC AAAAGC AGGGGGGAATGAGAAGTCAGAA 2 3 40 

CCCCCCACCTTTnrT&raTaaiTajirrrrTTTrAT^rr^mm^m^m,, > . _ . ^ _ 



jTTCGCAACCCGGGCTCCGr 
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AATGAAAGAC 
ATGGAAAAAT 
AGCTGAATAT 
AAGAACAGAT 
CCCCGGCTCA 
AGAGAACCAT 
TGAACTAACC 
ATAAAAGAGC 
GGTACCCGTG 
CTTGGGAGGG 
CTCGTCCGGG 
TGGAAGCTTC 
TGAGAATATG 
TGTCGAGCGG 
CTGCTCTGCA 
AGACCTCATC 
CCAGGTCCCC 
GCCCTTTGTA 
TGAACCTCCT 
AGGCGCCAAA 
TACAGAAGAC 
TGGTGGAGAA 
ACGTGGGAGA 
CGCAGGAGGA 
GAAAAATAAT 
TGTTCTCATC 
GACCGGAGAA 
TGGGCGCCCC 
CTGGGATTAC 
AGCGGGTCTC 
ACAAGGGCCC 
GTACACTCCT 
TTGGCAGTCT 
GACGCTTGGA 
AGAAAGAGAG 
GGATGAGCAG 
GGC CACTGTC 
ACTCGATCGC 
CAAGAAACCA 
CTAGGGAGGT 
GCAACCCGTC 
TGGACCCCTA 
CTGGACCACG 
TGTACCAGAC 
AATCCACTTT 
GTTGACCCTA 
TTCTCTAGGG 
GGGACTGGCA 
GTCCATAAAA 
GAGACTGTTG 
ACCCGTTAAG 
CAAGCGGGTG 
CCCACCGTCC 
ACTCCACCCC 
CTCAGGACAA 
TGATGAGGCA 
GCTACAGTAC 
TACTCGGGCC 
CCAAATTTGC 
GCTGACTGAG 
ACTAAGGGAG 
AATGGCAGCC 
CCAACAAAAG 
GCCAGATTTG 
TGTCCTAACG 
AGACCCAGTA 
GACAAAGGAT 
AGTAGAGGCA 



CCCACCTGTA 
ACATAACTGA 
GGGCCAAACA 
GGAACAGCTG 
GGGCCAAGAA 
CAGATGTTTC 
AATCAGTTCG 
CCACAACCCC 
TATCCAATAA 
TCTCCTCTGA 
ATCGGGAGAC 
TGCAGCATCG 
GGCCAGACTG 
ATCGCTCACA 
GAATGGCCAA 
ACCCAGGTTA 
TACATCGTGA 
CACCCTAAGC 
CGTTCGACCC 
CCTAAACCTC 
CCCCCGCCTT 
GCGACCCCTG 
CGGGAGCCCC 
AACGGACAGC 
AACCCTTCTT 
ACCCATCAGC 
GAAAAACAAC 
ACTCAACTGC 
ACCACCCAGG 
CAAAACGCGG 
AATGAGTCTC 
TATGACCCTG 
GCCCCAGACA 
GATTTGGTTA 
GAACGTATCA 
AAAGAGAAAG 
GTTAGTGGAC 
GACCAGTGTG 
CGAGGACCTC 
CAGGGTCAGG 
ACCTTCCTGG 
AGTGATAAGT 
GATCGCAAAG 
TGTCCCTATC 
GAGGGATCAG 
AATATAGAAG 
T CCA CAT GGC 
GTTCGCCAAG 
CAATACCCCA 
GACCAGGGAA 
AAACCAGGGA 
GAAGACATCC 
CACCAGTGGT 
ACCAGTCAGC 
TTGACCTGGA 
CTGCACAGAG 
GTGGATGACT 
CTGTTACAAA 
CAGAAACAGG 
GCCAGAAAAG 
TTCCTAGGGA 
CCCTTGTACC 
GCCTATCAAG 
ACTAAGCCCT 
CAAAAACTGG 
GCAGCTGGGT 
GCAGGCAAGC 
CTAGTCAAAC 



GGTTTGGCAA 
GAATAGAGAA 
C-GATATCTGT 
AATATGGGCC 
CAGATGGTCC 
CAGGGTGCCC 
CTTCTCGCTT 
TCACTCGGGG 
ACCCTCTTGC 
GTGATTGACT 
CCCTGCCCAG 

rp rp f^«p Q TG T T 0 

TTACCACTCC 
ACCAGTCGGT 
CCTTTAACGT 
AGATCAAGGT 
CCTGGGAAGC 
CTCCGCCTCC 
CGCCTCGATC 
AAGTTCTTTC 
ATAGGGACCC 
CGGGAGAGGC 
CTGTGGCCGA 
TTCAATACTG 
TTTCTGAAGA 
CCACCTGGGA 
GGGTGCTCTT 
CCAATGAAGT 
CAGGTAGGAA 
GCAGAAGCCC 
CCTCGGCCTT 
AGGACCCAGG 
TTGGGAGAAA 
GAGAGGCAGA 
GGAGAGAAAC 
AAAGAGATCG 
AGAAACAGGA 
CCTACTGCAA 
GGGGACCAAG 
AGCCCCCCCC 
TAGATACTGG 
CTGCCTGGGT 
TACATCTAGC 
CTCTGTTAGG 
GAGCTCAGGT 
ATGAGCATCG 
TGTCTGATTT 
CTCCTCTGAT 
TGTCACAAGA 
TACTGGT AC C 
CTAATGATTA 
ACCCCACCGT 
ACACTGTGCT 
CTCTCTTCGC 
CCAGACTCCC 
ACCTAGCAGA 
TACTGCTGGC 
CCCTAGGGAA 
TCAAGTATCT 
AGACTGTGAT 
CGGCAGGCTT 
CTCTCACCAA 
AAATCAAGCA 
TTGAACTCTT 
GACCTTGGCG 
GGCCCCCTTG 
TAACCATGGG 
AACCCCCCGA 



GCTAGCTTAA 
GTTCAGATCA 
GGTAAGCAGT 
AAACAGGATA 
CCAGATGCGG 
CAAGGACCTG 
CTGTTCGCGC 
CGCCAGTCCT 
AGTTGCATCC 
ACCCGTCAGC 
GGACCACCGA 

CTTAAGTTTG 
AGATGTCAAG 
CGGATGGCCG 
CTTTTCACCT 
CTTGGCTTTT 
TCTTCCTCCA 
CTCCCTTTAT 
TGACAGTGGG 
AAGACCACCC 
ACCGGACCCC 
CTCCACTACC 
GCCGTTCTCC 
TC C AGGT AAA 
CGACTGTCAG 
AGAGGCTAGA 
CGATGCCGCT 
CCACCTAGTC 
CAC CAATTTG 
CCTAGAGAGA 
GCAAGAAACT 
GTTAGAGAGG 
AAAGATCTTT 
AGAGGAAAAA 
TAGGAGACAT 
TAGACAGGGA 
AGAAAAGGGG 
ACCCCAGACC 
TGAACCCAGG 
GGCCCAACAC 
CCAAGGGGCT 
TACCGGTAAG 
AAGAGATTTG 
TATGGGACCA 
GCTACATGAG 
TCCTCAGGCC 
CAT AC CTC T G 
AGCCAGACTG 
CTGCCAGTCC 
TAGGCCTGTC 
GCCCAACCCT 
TGATTTAAAG 
CTTTGAGTGG 
AC AGGGTTT C 
CTTCCGGATC 
CGCCACTTCT 
CCTCGGGTAT 
GGGGTATCTT 
GGGGCAGCCT 
CTGTCGCCTC 
AACGGGGACT 
AGCTCTTCTA 
TGTCGACGAG 
TC GGC CGGTG 
CCTACGGATG 
ACAGCCACTA 
CCGCTGGCTT 



GTAACGC CAT 
AGGTCAGGAA 
TCCTGCCCCG 
TCTGTGGTAA 
TCCAGCCCTC 
AAATGACCCT 
GCTTCTGCTC 
C CG ATTG ACT 
GACTTGTGGT 
GGGGGTCTTT 
CCCACCACCG 

ACCTTAGGTC 
AAGAGACGTT 
C GAG AC GGC A 
GGCCCGCATG 
GACCCCCCTC 
TCCGCCCCGT 
CCAGCCCTCA 
GGGCCGCTCA 
CCTTCCGACA 
TCCCCAATGG 
TCGCAGGCAT 
TCTTCTGACC 
CTGACAGCTC 
CAGCTGTTGG 
AAGGCGGTGC 
TTTCCCCTCG 
CACTATCGCC 
GCCAAGGTAA 
CTTAAGGAAG 
AATGTGTCTA 
TTAGAAGATT 
AATAAACGAG 
GAAGAACGCC 
AGAGAGATGA 
GGAGAACGAA 
CACTGGGCTA 
TCCCTCCTGA 
ATAACCCTCA 
TCCGTGCTGA 
ACTGGAGGAA 
GTCACCCACT 
CTGACTAAAC 
ATGGGGCAGC 
ACCTCAAAAG 
TGGGCGGAAA 
AAAGCAACCT 
GGGATCAAGC 
CCCTGGAACA 
CAGGATCTGA 
TACAACCTCT 
GATGCCTTTT 
AG AG ATC CAG 
AAAAACAGTC 
CAGCACCCAG 
GAGCTAGACT 
CGGGCCTCGG 
CTAAAAGAGG 
ACTCCGAAGA 
TGGATCCCTG 
CTGTTTAATT 
ACTGCCCCAG 
AAGCAGGGCT 
GCCTACCTGT 
GTAGCAGCCA 
GTCATTCTGG 
TCCAACGCCC 



TTTGCAAGGC 

CAGATGGAAC 

GCTCAGGGCC 

GCAGTTCCTG 

AGCAGTTTCT 

GTGCCTTATT 

CCCGAGCTCA 

GAGTCGCCCG • 

CTCGCTGTTC 

CATTTGGGGG 

GGAGGTAAGC 

ACTGGAAAGA 
GGGTTACCTT 
CCTTTAACCG 
GACACCCAGA 
CCTGGGTCAA 
CTCTCCCCCT 
CTCCTTCTCT 
TC G ACCT ACT 
GGGACGGAAA 
CATCTCGCCT 
TCCCCCTCCG 
TTTACAACTG 
TGATCGAGTC 
GGACTCTGCT 
GGGGCGATGA 
AGCGCCCAGA 
AG TTGCTCCT 
AAGGAATAAC 
C CT ATC G CAG 
TGTCTTTCAT 
TAAAAAACAA 
AAACCC CGGA 
GTAGGAGAGA 
GCAAGCTATT 
GGAGGTCCCA 
AAGATTGTCC 
CCCTAGATGA 
AAGTCGGGGG 
CCCAAAATCC 
AGCGGTATCG 
CTTTCCTCCA 
TAAAAGCCCA 
CCCTGCAAGT 
AGC CAG ATGT 
CCGGGGGCAT 
CTACCCCCGT 
CCCACATACA 
CGCCCCTGCT 
GAGAAGTCAA 
TGAGCGGGCT 
TCTGCCTGAG 
AGATGGGAAT 
CCACCCTGTT 
ACTTGATCCT 
GCCAACAAGG 
CCAAGAAAGC 
G TC AG AG ATG 
CCCCTCGACA 
GGTTTGCAGA 
GGGGCCCAGA 
CCCTGGGGTT 
ACGCCAAAGG 
CCAAAAAGCT 
TTGCCGTACT 
CCCCCCATGC 
GGATGACTCA 
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^ A J GAGGGG TTGCTTTTGG ACACGGACCG GGTCCAGTTC GGACCGGTGG TAGCCCTGAA A1 , n 

CTGCTCCCAC TGCCTGAGGA AGGGCTGCAA GACAACTGCC TTGATATCCT - 4?nn 
GGCCGAAGCG CACGGAACCC GACCCGACCT AACGGACCAG CCGCTCCCAG ACGCCGAcS locn 
CACCTGGTAC ACGGATGGAA GCAGTCTCTT ACAAGAGGGA CAGCGTAAGG CGGGAGCTGC 2?™ 
™^"f C GAGAGG GAGG TAATCTGGGC TAAAGCCCTG CCAGCCGGGA 4^n 
GCGGGCTGAA CTGATAGGAC TCACCCAGGC CCTAAAGATG GCAGAAGGTA ^GAAGCTaS 

I*^^ T GATAGCCGTT atgctittgc tactgccgat atccatggag aKSSSS 

^^ST 000 TTCG TCACAT CAGAAGGCAA AGAGATCAAA AATAAAGACG AGATCTTGG^ 
CCTACTAAAA GCCGTCTTTC TGCCCAAAAG ACTTAGCATA ATCCATTGTC caggacatca* 
AAAGGGACAC AGCGCCGAGG CTAGAGGGAA CCGGATGGCT GACCAAGCGG CCCGAAAGGr 
AGCCATCACA GAGACTCCAG ACACCTCTAC CCTCCTCATA GAAAATTCAT caccctacac 
CTCAGAACAT TTTCATTAGA CAGTGACTGA TATAAAGGAC CTAACCAAGT TGGGGGCCAT 
TTATGATAAA ACAAAGAAGT ATTGGGTCTA CCAAGGAAAA CCTGTGATGC CTGACCAGTT 

TTATTAGACT TTCTTCATCA GCTGACTCAC CTCAGCTTCT CAAAAATgIa 
GGCTCTCCTA GAGAGAAGCC ACAGTCCCTA CTACATGCTG AACCGGGATC GAACACTCAA 
AAATATCACT GAGACCTGCA AAGCTTGTGC ACAAGTCAAC GCCAGCAAGT CTGCCGTTAA 
r^f^t^l AGGGTCCGCG GGCATCGGCC CGGCACTCAT TGGGAGATCG ATTTCACCGA 
GATAAAGCCC GGATTGTATG GCTATAAATA TCTTCTAGTT TTTATAGATA CCTTTTrTnr 

GCCTTCGG AA CCAAGAAAGA AACCGCCAAG GTCGTAACCA aSI^aCT 
A ^S2£ GA * G TTCCCCAGGT TCGGCATGCC TCAGGTATTG GGAACTGACA ATGGGCCtS 

aaggtgagtc agacagtggc cgatctgttg ggoattgatt GGAAAT^aS 

2T^2S A ? AC AGACCCCAAA GCTCAGGCCA GGTAGAAAGA ATGAATAGAA CCATCAAGGA 
AAATTAACGC TTGCAACTGG CTCTAGAGAC TGGGTGCTCC TACTCCCCTT 
AGCCCTGTAC CGAGCCCGCA ACACGCCGGG CCCCCATGGC CTCACCCCAT ATCAGATCTT 
A T A I^ CA CCCCCGCCCC TTGTAAACTT CCCTGACCCT GACATGACAA GaSSSSE 
CAGCCCCTCT CTCCAAGCTC ACTTACAGGC TCTCTACTTA GTCCAGCACG AAGTCTGGAG 
A ^^ G J GGGG GCAGCCTACC AAGAACAACT GGACCGACCG GTGGTACCTC JSSSSS 
tS™^ C ACAGTGTGGG TCCGCCGACA CCAGACTAAG AACCTAGAAC CTCGCTGGAA 
5S^S?^ C ACAGTCCTGC TGACCACCCC CACCGCCCTC AAAGTAGACG GCATCGCAGC 
TTGGATACAC GCCGCCCACG TGAAGGCTGC CGACCCCGGG GGT GGAC CAT CCTCTAGACT 
™^S GC GTTCAACGCT CTCAAAACCC CTTAAAAATA AGGTTAACCC GCGAGGCCCC 
SI^1 GGCCT TAATTCTTCT GATGCTCAGA GGGGTCAGTA CTGCTTCGCC CGGCTCCAGT 
GCGGCCCAGC CGGCC AC CAT GAAAACATTT AACATTTCTC AACAAGATCT AGAATTAGTA 
CAGAGAAGAT TACAATGCTT TATGAGGATA ATAAACATCA TGTGGGAGCG 
CGAAAACAGG AGAAAT C ATT TCGGCAGTAC ATATTGAAGC GTATATAGGA 
?t^E^^° TTTGTGCAGA AGC CATTGC G ATTCGTAGTG CAGTTTCGAA TGGACAAAAG 
^7H1^ ACA CGATTGTAGC TGTTAGACAC CCTTATTCTG AC G AAGT AG A TAGAAGTATT 
^2£ G S^ GTCCTTGTGG TATGTGTAGG GAGTTGATTT CAGACTATGC AC CAG ATTGT 
TTTGTGTTAA TAGAAATGAA TGGCAAGTTA GTCAAAACTA CGATTGAAGA ACTCATTCCA 
CTCAAATATA CCCGAAATTA AAAGTTTTAC CACCAAGCTT ATCGATTAGT CCAATTTGTT 
AAAGACAGGA TATCAGTGGT C CAGGCTC T A GTTTTGACTC AACAATATCA CCAGCTGAAG 
2=TaS^?H AC G AGCC AT A GATAAAATAA AAGATTTTAT TTAGTCTCCA GAAAAAGGGG 6600 
GGAATGAAAG ACCCCACCTG TAGGTTTGGC AAGCTAGCTT AAGTAACGCC ATTTTGCAAG 6660 
GCATGGAAAA ATACATAACT GAGAATAGAG AAGTTCAGAT CAAGGTCAGG AACAGATGGA 672 0 

ACAGTCGAGA ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA 
AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC 
AATGTATCTT ATCATGTCTG GATCCCCAGG AAGCTCCTCT GTGTCCTCAT AAACCCTAAC 
CTCCTCTACT TGAGAGGACA TTCCAATCAT AGGCTGCCCA TCCACCCTCT GTGTCCTCCT 
G ;^£7 AGG tcacttaaca AAAAGGAAAT TGGGTAGGGG TTTTTCACAG ACCGCTTTCT 
AAGGGTAATT TTAAAATATC TGGGAAGTCC CTTCCACTGC TGTGTTCCAG AAGTGTTGGT 
AAACAGCCCA CAAATGTCAA CAGCAGAAAC ATACAAGCTG TCAGCTTTGC ACAAGGGCCC 7140 
^ AG ^EZ GG ATC AAGAA GCAC TGTGGT TCCTCTGTTA GTAATGTGCA AAACAGGAGG 72 00 

CACATTTTCC CCACCTGTGT AGGTT C C AAA ATATCTAGTG TTTTCATTTT TACTTGGATC 72 60 

A ^ GAACCCAG CACTCCACTG GATAAGCATT ATCCTTATCC AAAACAGCCT TGTGGTCAGT 73 2 0 

G.TCATCTGC TGACTGTCAA CTGTAGCATT TTTTGGGGTT ACAGTTTGAG CAGGATATTT 73 8 0 

GGTCCTGTAG TTTGCTAACA CACCCTGCAG CTCCAAAGGT TCCCCACCAA CAGCAAAAAA 744 0 

A.GAAAATTT GACCCTTGAA TGGGTTTTCC AGCAC CAT TT TCATGAGTTT TTTGTGTCCC 75 00 

TGAATGCAAG TTTAACATAG CAGTTACCCC AATAACCTCA GTTTTAACAG T\ACAGCT~C 75 6 0 

CCACATCAAA ATATTTCCAC AGGTT AAGTC CTCATTTAAA TTAGGCAAAG GAATTC 7616 
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Figure 7. hCMV+intron Sequence i 

i£S£S? SSS^C CGCATAOTA ■ 50 

TAAGCTACAA CAACgSIS SSSSSac S?£££gA SACCAAAATT if J 

esse ss sssi iEi i i as 

= 5SSS5SJ ?S££5 IsE ilii ™- 3." 

TGGGTGGACT ATTTACGGTA AACTGCCcIc T^^T^ ^TH CCA TCACGTCAA 4 20 

AGTACGCCCC CTATTGACGT £S2Sa ^ ATATGC ^" 480 

ATGACCTTAT GGGACTTTCC TAc£?£S£ ^A^ACG tSSctcI^ C^a^ 0 540 
S2™2S* SI^AAT GGGCGTGGAT AGCGGTTTGA SSSg ' ^ 

GACTTTCCAA AATGTCGtXa CAACTCCGCC C^£gac£ JJSSSES* ^TCAACGG 720 

CGGTGGGAGG TCTATATAAG SSScTC GA^AA^^ ""^S^™ ^0 

GCTTATCGAA ATGTCGACTG AGAACTTCAG GGTGAGTTTC rj^*^™ TG CTTAACTG 840 

CTTTTTCGCT ATTGTAAAAT TCATGTTATA TGGAGGGGGC T^r^^ ATTGTTCTTT 900 

TAGAATGGGA AGATGTCCCT TGTATCACCA TGGACCCTCA GGGTGTTGTT 960 

CTTTCTACTC TGTTGACAAC CATTCTCTCC TCTTATTTTC S^*^ GTT-TCTTTCA 1020 

TTCGTTAAAC TTTAGCTTGC ATTTCTAACG AM^IIf I^TSf^" CTGTAACTTT 1080 

AGATTGTAAG TACTTTCTCT AATCACTTTT ScaI^ USSS? ^ rTATTTGTC 1140 

TACTTCAGCA CAGTTTTAGA g1£caS£ S^Sa r^^P 3 1200 

GCATATAAAT TCTGGCTGGC GTGGAAATAT TCTTATTGGT T^l^ft^ GAATATTTCT 1260 

CATCATCCTG CCTTTCTCTT TATGGTTACA SgItaTa^I ^^E^rl ACATCCTGGT 1320 

ATACTCTGAG TCCAAACCGG GCCCCTOTGC TAACCATGTT ^S^I^ 1380 

ACAGCTCCTG GGCAACGTGC TC^TTGTTGT 25S£?SS SSSSS IcIIXS^ 1440 

GCAAGCTTCT GCAGCATCGT TCTGTGTTGT CTOTTTrrra o^JZI^SS^ AGA ATTGGCC 15 00 

GAGAATATGG GCCAGACra SSEgg ££25^ S£E£g C^Hr^ 1560 

^GC^TG^G ISSSfJ* CCAG *CGGTA GATGTCAAGA AGAGACGTTG SggJSg 

Iv-TGCAG AATGGCCAAC CTTTAACGTr r^iTr^rrrr wiialliit. 1680 

GACCTCATCA CCCAGGTTAA GA^CA^C G^CCGCA^ aTI^^ 1740 

CAGGTCCCCT ACATCGTGAC CTGGGAAGCC TTCGCTTTTG AOcSrc£5 ^S^S^ 1800 
CCCTTTGTAC ACCCTAAGCC TCCGCCTCCT CTTCCTCCAT C^CCCG^ ^S^^S 
^GC^SIr ^ GACCCC SCCTCGATCC t£££££ CA^CCT^C TCCTTCTCTA 
^AAACCTCA AGTTCTTTCT GACAGTGGGG GGCCGCTCAT CGACCTaSt 
cS5S£i^ GG CCGCCTTA TAGGGACCCA AGACCACCCC CTTCCGACAG GGACGGAAAT 
S^A^tr ™^ TGC GCGASAGGCA CCGGACCCCT CCCCAATGGC ATCTCGCCTA 
gSS^** g 5S agccccc tgtggccgac TCCACTACCT CGCAGGCATT CCCCCTCCG^ 
aa^a™™ ^GGACAGCT TCAATACTGG CCGTTCTCCT CTTCTGACCT TTACAACTGG 
G^CA^S Cr'SSZr'S ^J^ AAGAT «»«™AAC TGACAGCTCT gI^gIS^ 
arrS-^T^ CCCATCAGCC CACCTGGGAC GACTGTCAGC AGCTGTTGGG GACTCTGOTG 
gggcgcccca AAAAAC ^S° GGTGCTCTTA GAGGCTAGAA AGGCGGTGCG §£?SS5? 

CA C TCAACTGCC CAATGAAGTC GATGCCGCTT TTCCCCTCGA r£r£r£i^Y,~ 

gcgggtctcc aggtaggaac cacctSSS ISa^gcS g^c^cta 

caaggS?™ A ^ AAGG 5SSS cagaagcccc accaatttgg ccaaggtaaa aggaataaca 

^li i™ SSSS SSESS SSJggffi ™1 
H™ H™ S£££; 32Sg& ISSS 

GAGAGAAACA GAGGAAAAAG AAGAACGCCG TAGGACAGAG 
rrSr^ AAGAGAAAGA AAGAGATCGT AGGAGACATA GAGAGATGAG CAAGCTATTG 
CTCGATCGCG GAAACAGG ^T AGACAGGGAG GAGAACGAAG S^CC^ 

aI^T^S ACCAGTGTGC CTACTGCAAA GAAAAGGGGC ACTGGGCTAA AGATTGTCrr 
GAGGACCTCG GGGAC CAAG A CCCCAGACCT CCCTCCTGAC CctI^GAC 

c^^^o? ^ G 2S TCAGGA gcccccccct GAACCCAGGA taaccctcaa agtcgg^ 

CCTTCCTGGT AGATACTGGG GCCCAACACT CCGTGCTGAC cS^^CT 
T^AcSc^ A?CG^rT I^^ 0 CAAGOOQCTA CTGGAGGAA^ G^^CGC 
rrlrr^^ ATCGCAAAGT ACATCTAGCT ACCGGTAAGG TCACCCACTC TTTCCTCCAT 
A^rr^S^ GTCCCTATCC TCTGTTAGGA AGAGATTTGC TGACTAAACT AAAAGCCC^I 
T^^™ G AGGGAT <=AGG AG C TC AGGTT ATGGGACCAA TGGGGCAGCC CCTGCAAGTG 
TCTCTAGGGT cJtrt^^ TGAGCATCGG CTACATGAGA CCTCA^XgA GC^^GCT 
^?^2S? T CCACATGGCT GTCTGATTTT CCTCAGGCCT GGGCGGAAAC CGGGGGCATG 
GGACTGGCAG TTCGCCAAGC TCCTCTGATC ATACCTCTGA aagc^acctc -ACC^^ 
Ir^Z^Z AATACCCCAT GTCACAAGAA GCCAGACTGG GGAT^AgJc CCA^™ 
rrn^TIS? ACCAGGCA AT ACTGGTACCC TGCCAGTCCC C CTGG AACAC GCC^GCTA 
CCCGTTAAGA AACCAGGGAC TAATGATTAT AGGCCTGTCC AGGATCTGAr S^rS 
^CATCCA CCCCACCGTG C^^CTT ^^^C 
ACCAGTGGTA CACTGTGCTT GATTTAAAGG ATGCCTTTTT S^C^AGA 
CTCCACCCCA CCAGTCAGCC TCTCTTCGCC TTTGAGTGGA GAGATCCa" ^G^TC 
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J GA ^ GAAT TGACCTGGAC CAGACTCCCA CAGGGTTTCA AAAACAGTCC CACCCTGTT- 
^^ AC TGC ACAGAGA CCTAGCAGAC TTCCGGATCC AGCACCCAGA CTTGATCCTG 
^^^^ ACG TGGAT ^ACTT ACTGCTGGCC GCCACTTCTG AGCTAGACTG C^£2£S 
cSI^GCC IcI^^ CCTAGGGAAC CTCGGGTATC GGGCCTCG^C ££££££ 4320 
CAAATTTGCC AGAAACAGGT CAAGTATCTG GGGTATCTTC T AAAAG ^GGG TCAGAGATrcr TtoX 
CTGACTGAGG GC AG AAAAG A GACTGTGATG GGGCAGCCTA CTCCGA^GAC CCCTCGAGAA 
CTAAGGGAGT TCCTAGGGAC GGCAGGCTTC TGTCGCCTCT GGATCCCTGG GTWgSgaI 
ATGGCAGCCC CCTTGTACCC TCTCACCAAA ACGGGGACTC TGTTTAATTG ^cSga£ * 

CAACAAAAGG cctatcaaga AATCAAGCAA gctcttctaa ctgccccagc C^^^c 

CCAGATTTGA CTAAGCCCTT TGAACTCTTT GTCGACGAGA AGCAGGGCTA CGCCA^SS - Itln 
GTCCTAACGC AAAAACTGGG ACCTTGGCGT CGGCCGGTGG cctacctgtc caaaaa^otI 
GACCCAGTAG CAGCTGGGTG GCCCCCTTGC CTACGGATGG TAGCAGCCAT TGCCGTACTC 
ACAAAGGATG CAGGCAAGCT AACCATGGGA CAGCCACTAG TCATTCTGGC CCCCCATnra 
TAGTCAAACA ACCCCCCGAC CGCTGGCTTT CCAACGCCCG GATCACtSc 
TATCAGGCCT TGCTTTTGGA CACGGACCGG GTCCAGTTCG GACCGGTGGT AGCCCTGAAr 
CCGGCTACGC TGCTCCCACT GCCTGAGGAA GGGCTGCAAC ACAACTGCCT TGATATCCTG 
^^2 AAGCCC ACGGAACCCG ACCCGACCTA ACGGACCAGC CGCTCCCAGA CGCCG^Sc 

rSS25™ cggatggaag cagtctctta caagagggac agcgtaaggc ggStct^g 
gtgaccaccg agaccgaggt aatctgggct aaagccctgc cagccgggac atccg-tcag 
cgggctgaac tgatagcact cacccaggcc ctaaagatgg cagaagctaa gI^^™ 
g ™*Z^tg atagccgtta tgcttttgct actgcccata tccatggISa 
a 5?^tgggt tgctcacatc agaaggcaaa gagatcaaaa ataaagacga gatcttggEc 

CTACTAAAAG CCCTCTTTCT GCCCAAAAGA CTTAGCATAA TCCATTGTCC AGGACATCAA 
^^S™^ GCGCCGAGGC TAGAGGCAAC CGGATGGCTG AC CAAGCGG C CCGAA^^*A 
GCCATCACAG AGACTCCAGA CACCTCTACC CT C CT CAT AG AAAATTCATC ACCCTACACC 
^™^T T TTCATTAC AC AGTGAC TGAT ATAAAGGACC TAACCAAGTT GGgSS£S£S 

CAAAGAAGTA TTGGGTCTAC CAAGGAAAAC CTGTGATGCC TGAcSgTTT 
ft^lTP 0 ^ TATTAGACTT TCTTCATCAG CTGACTCACC TCAGCTTCTC AAAAATGAAG 

AGAGAAGCCA CAGTCCCTAC TACATGCTGA ACCGGGATCG AACACTCAAA 
AATATCACTG AGACCTGCAA AGCTTGTGCA CAAGTCAACG CCAGCAAGTC TGC C GTT AAA 
CAGGGAACTA GGGTC CGCGG GCATCGGCCC GGCACTCATT GGGAGATCGA TTTCACCGAG 
ATAAAGCCCG GATTGTATGG CTATAAATAT CTTCTAGTTT TTATAGATAC CTTTTCTGGC 
TGGATAGAAG CCTTCCCAAC CAAGAAAGAA ACCGCCAAGG TCGTAACCAA GAAGCTACTA 
TTCGTCTCCA lc^?Zrl GGGCATGCCT C AGGT ATTGG GAACTGACAA TGGGCCTGCC 6120 
TCCA AGGTGAGTCA GACAGTGGCC GATCTGTTGG GGATTGATTG GAAATTACAT 61R0 
TS^ ATACA ^CCCCAAAG CTCAGGCCAG GTAGAAAGAA TGAATAGAAC CATCAAGGAG S240 
ACTTTAACTA AATTAACGCT TG CAACTGGC TCTAGAGACT GGGTGCTCCT ACTCCcEtTA S3 0D 

£E=S££r rr^rZ^ CACGGGGGGG CCCCATGGCC TCACCCCAtI Scm 60 
T A ;555S CAC CCCCGCCCCT TGTAAACTTC CCTGACCCTG ACATGACAAG AGTTACTAAC 64^0 
A ^CCTCTC TCCAAGC TCA CTTACAGGCT CTCTACTTAG TCCAGCACGA AGT CTGG AG A 6480 
GTCGGCGAPA r^23 AGGA AGAAC AAC TG GACCGACCGG TGGTACCTCA CCCT^CGA stto 
5I G ??SS A 2 A CAGTGTGGGT CCGCCGACAC CAGACTAAGA ACCTAGAACC TCG CTGG AAA 
GGACCTTACA CAGTCCTGCT GACCACCCCC ACCGCCCTCA AAGTAGACGG CAT C GCAGCT 
? G ^J AGAGG CCGCCCACGT GAAGGCTGCC GACCCCGGGG GTGGACCATC CTCTAGACTG 
ACATGGCGCG TTCAACGCTC TCAAAACCCC TTAAAAATAA GGTTAACCCG CGAGGCCCCC 
TAATCCCCTT AATTCTTCTG ATGCTCAGAG GGGTCAGTAC TGCTTCGCCC GGCTCCAGTG 
CGGCCCAGCC GGCCACCATG AAAACATTTA ACATTTCTCA ACAAGATCTA GAATTAGTAG 
^?^ GCGAC AGAGAAGATT ACAATGCTTT ATGAGGATAA T AAACAT CAT GTGGGAG CGG 
CAATTCGTAC GAAAACAGGA GAAATCATTT CGGCAGTACA TATTGAAGCG TATATAGGAC 70^0 
GA S^ ACTGT TTG TGCAGAA GCCATTGCGA TTGGTAGTGC AGTTTCGAAT GGACAAAAGG 70^0 
A TTTT GAGAC GATTGTAGCT GTTAGACACC CTTATTCTGA C GAAGT AGAT AG AAGT ATT C 7140 
GAG I5^ TAAG TCCTTGTGGT ATGTGTAGGG AGTT GATTTC AGACTATGCA CCAGATTGTT 720 0 

TTGTGTTAAT AGAAATGAAT GGCAAGTTAG TCAAAACTAC GATTGAAGAA CTCATTCCAC 72 50 

TCAAATATAC CCGAAATTAA AAGTTTTACC ACCAAGCTTA TCGAATTC 73 0 3 
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Figure 8. hCMV+intronknSD Sequence 1 
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AGATCTCCCG 
AGCCAGTATC 
TAAGCTACAA 
CGTTTTGCGC 
AGTTATTAAT 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGACT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 



CGGTGGGAGG 
GCTTATCGAA 
CTTTTTCGCT 
TAGAATGGGA 
CTTTCTACTC 
TTCGTTAAAC 
AGATTGTAAG 
TACTTCAGCA 
GCATATAAAT 
CATCATCCTG 
ATACTCTGAG 
ACAGCTCCTG 
GCAAGCTTCT 
GAGAATATGG 
GTCGAGCGGA 
TGCTCTGCAG 
GACCTCATCA 
CAGGTCCCCT 
CCCTTTGTAC 
GAACCTCCTC 
GGCGCCAAAC 
ACAGAAGACC 
GGTGGAGAAG 
CGTGGGAGAC 
GCAGGAGGAA 
AAAAATAATA 
GTTCTCATCA 
ACCGGAGAAG 
GGGCGCCCCA 
TGGGATTACA 
GCGGGTCTCC 
CAAGGGCCCA 
TACACTCCTT 
TGGCAGTCTG 
ACGCTTGGAG 
GAAAGAGAGG 
GATGAGCAGA 
GCCACTGTCG 
CTCGATCGCG 
AAGAAACCAC 
TAGGGAGGTC 
CAACCCGTCA 
GGACCCCTAA 
TGGACCACGG 
GTAC CAG ACT 
ATCCACTTTG 
TTGACCCTAA 
TCTCTAGGGT 
GGACTGGCAG 
TCCATAAAAC 
AGACTGTTGG 
CCCGTTAAGA 
AAGCGGGTGG 
CCACCGTCCC 
CTCCACCCCA 



ATCCCCTATG 
TGCTCCCTGC 
CAAGGCAAGG 
TGCTTCGCGA 
AGTAATCAAT 
TT AC GG T AAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
ATGTCGACTG 
ATTGTAAAAT 
AGATGTCCCT 
TGTTGACAAC 
TTTAGCTTGC 
TACTTTCTCT 
CAGTTTTAGA 
TCTGGCTGGC 
CCTTTCTCTT 
TCCAAACCGG 
GGCAACGTGC 
GCAGCATCGT 
GCCAGACTGT 
TCGCTCACAA 
AATGGCCAAC 
CCCAGGTTAA 
ACATCGTGAC 
ACCCTAAGCC 
GTTCGACCCC 
CTAAACCTCA 
CCCCGCCTTA 
CGACCCCTGC 
GGGAGCCCCC 
ACGGACAGCT 
ACCCTTCTTT 
CCCATCAGCC 
AAAAACAACG 
CTCAACTGCC 
CCACCCAGGC 
AAAACGCGGG 
ATGAGTCTCC 
ATGACCCTGA 
CCCCAGACAT 
ATTTGGTTAG 
AACGTATCAG 
AAGAGAAAGA 
TTAGTGGACA 
ACCAGTGTGC 
GAGGACCTCG 
AGGGTCAGGA 
CCTTCCTGGT 
GTGATAAGTC 
ATCGCAAAGT 
GTCCCTATCC 
AGGGATCAGG 
ATATAGAAGA 
CCACATGGCT 
TTCGCCAAGC 
AATACCCCAT 
ACCAGGGAAT 
AACCAGGGAC 
AAGACATCCA 
ACCAGTGGTA 
CCAGTCAGCC 



GTCGACTCTC 
TTGTGTGTTG 
CTTGACCGAC 
TGTACGGGCC 
TACGGGGTCA 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTCTC 
AGAAC TTC AG 
TCATGTTATA 
TGTATCACCA 
CATTGTCTCC 
ATTTGTAACG 
AATCACTTTT 
GAACAATTGT 
GTGGAAATAT 
TATGGTTACA 
GCCCCTCTGC 
TGGTTGTTGT 
TCTGTGTTGT 
TACCACTCCC 
CCAGTCGGTA 
CTTTAACGTC 
GATCAAGGTC 
CTGGGAAGCC 
TCCGCCTCCT 
GCCTCGATCC 
AGTTCTTTCT 
TAGGGACCCA 
GGGAGAGGCA 
TGTGGCCGAC 
TCAATACTGG 
TTCTGAAGAT 
GACCTGGGAC 
GGTGCTCTTA 
CAATGAAGTC 
AGGACGCAAC 
CAGAAGCCCC 
CTCGGCCTTC 
GGACCCAGGG 
TGGGAGAAAG 
AGAGGCAGAA 
GAGAGAAACA 
AAGAGATCGT 
GAAACAGGAT 
CTACTGCAAA 
GGGACCAAGA 
GCCCCCCCCT 
AGATACTGGG 
TGCCTGGGTC 
ACATCTAGCT 
TCTGTTAGGA 
AGCTCAGGTT 
TGAGCATCGG 
C-TCTGATTTT 
TCCTCTGATC 
GTCACAAGAA 
ACTGGTACCC 
TAATGATTAT 
CCCCACCGTG 
CACTGTGCTT 
TCTCTTCGCC 



AGTACAATCT 
GAGGTCGCTG 
AATTGCATGA 
AGATATACGC 
TTAGTTCATA 
GGCTGACCGC 
ACGCCAATAG 
TTGGCAGTAC 
AAATGGCCCG 
TACATCTACG 
GGGCGTGGAT 
GGGAGTTTGT 
CCATTGACGC 
TGGCTAACTA 
GGTGAGTTTG 
TGGAGGGGGC 
TGGACCCTCA 
TCTTATTTTC 
AATTTTTAAA 
TTTTCAAGGC 
TATAATTAAA 
TCTTATTGGT 
ATGATATACA 
TAACCATGTT 
GCTGTCTCAT 
CTCTGTCTGA 
TTAAGTTTGA 
GATGTCAAGA 
GGATGGCCGC 
TTTTCACCTG 
TTGGCTTTTG 
CTTCCTCCAT 
TCCCTTTATC 
GACAGTGGGG 
AGACCACCCC 
CCGGACCCCT 
TCCACTACCT 
CCGTTCTCCT 
C C AGGTAAAC 
GACTGTCAGC 
GAGGCTAGAA 
GATGCCGCTT 
CACCTAGTCC 
AC CAATTTGG 
CTAGAGAGAC 
CAAGAAACTA 
TTAGAGAGGT 
AAGATCTTTA 
GAGGAAAAAG 
AGGAGACATA 
AGACAGGGAG 
GAAAAGGGGC 
CCCCAGACCT 
GAACCCAGGA 
GCCCAACACT 
CAAGGGGCTA 
ACCGGTAAGG 
AGAGATTTGC 
ATGGGACCAA 
CTACATGAGA 
CCTCAGGCCT 
ATACCTCTGA 
GCCAGACTGG 
TGCCAGTCCC 
AGGCCTGTCC 
CCCAACCCTT 
GATTTAAAGG 
TTTGAGTGGA 



GCTCTGATGC 
AGTAGTGCGC 
AGAATCTGCT 
GTTGACATTG 
GCCCATATAT 
CCAACGACCC 
GGACTTTCCA 
ATCAAGTGTA 
CCTGGCATTA 
TATTAGTCAT 
AGCGGTTTGA 



CGCATAGTTA 
GAGCAAAATT 
TAGGGTTAGG 
ATTATTGACT 
GGAGTTCCGC 
CCGCCCATTG 
TTGACGTCAA 
TCATATGCCA 
TGCCCAGTAC 
CGCTATTACC 
CTCACGGGGA 



AAATGGGCGG 
GAGAACCCAC 
GGGACCCTTG 
AAAGTTTTCA 
TGATAATTTT 
TTTTCATTTT 
TTCACTTTTG 
AATCAGGGTA 
TGATAAGGTA 
AGAAACAACT 
CTGTTTGAGA 
CATGCCTTCT 
CATTTTGGCA 
CTGTGTTTCT 
CCTTAGGTCA 
AGAGACGTTG 
GAGACGGCAC 
GCCCGCATGG 
ACCCCCCTCC 
CCGCCCCGTC 
CAGCCCTCAC 
GGCCGCTCAT 
CTTCCGACAG 
CCCCAATGGC 
CGCAGGCATT 
CTTCTGACCT 
TGACAGCTCT 
AGCTGTTGGG 
AGGCGGTGCG 
TTCCCCTCGA 
ACTATCGCCA 
C CAAGGT AAA 
TTAAGGAAGC 
ATGTGTCTAT 
TAGAAGATTT 
ATAAACGAGA 
AAGAACGCCG 
GAGAGATGAG 
GAGAACGAAG 
ACTGGGCTAA 
CCCTCCTGAC 
TAACCCTCAA 
CCGTGCTGAC 
CTGGAGGAAA 
TCACCCACTC 
TGACTAAACT 
TGGGGCAGCC 
CCTCAAAAGA 
GGGCGGAAAC 
AAGCAACCTC 
GGATCAAGCC 
CCTGGAACAC 
AGGATCTGAG 
ACAACCTCTT 
ATGCCTTTTT 
GAGATCCAGA 



TAGGCGTGTA 
TGCTTAACTG 
ATTGTTCTTT 
GGGTGTTGTT 
GTTTCTTTCA 
CTGTAACTTT 
TTTATTTGTC 
TATTATATTG 
GAATATTTCT 
ACATCCTGGT 
TGAGGATAAA 
TCTTTTTCCT 
AGAATTGGCC 
GTATTTGTCT 
CTGGAAAGAT 
GGTTACCTTC 
CTTTAACCGA 
ACACCCAGAC 
CTGGGTCAAG 
TCTCCCCCTT 
TCCTTCTCTA 
CGACCTACTT 
GGACGGAAAT 
ATCTCGCCTA 
CCCCCTCCGC 
TTACAACTGG 
GATCGAGTCT 
GACTCTGCTG 
GGGCGATGAT 
GCGCCCAGAC 
GTTGCTCCTA 
AGGAATAACA 
CTATCGCAGG 
GTCTTTCATT 
AAAAAACAAG 
AACCCCGGAA 
TAGGACAGAG 
CAAGCTATTG 
GAGGTCCCAA 
AGATTGTCCC 
CCT AGATG AC 
AGTCGGGGGG 
CCAAAATCCT 
GCGGTATCGC 
TTTCCTCCAT 
AAAAGCCCAA 
CCTGCAAGTG 
GCCAGATGTT 
CGGGGGCATG 
TACCCCCGTG 
CCACATACAG 
GCCCCTGCTA 
AGAAGTCAAC 
GAGCGGGCTC 
CTGCCTGAGA 
GATGGGAATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 

* A. V 

780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
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TCAGGACAAT TGACCTGGAC CAGACTCCCA CAGGGTTTCA AAAACAGTCC C^CCC^GTT^ 

GATGAGGCAC TGCACAGAGA CCTAGCAGAC TTCCGGATCC AGCAC C CAG A CTTGATCCTG 4? no 

CTAGAGTACG TGGATGACTT ACTGCTGGCC GCCACTTCTG AGCTAGACTG CC AACAAGGT lUn 

AGTCGGGCCC TGTTACAAAC CCTAGGGAAC CTCGGGTATC GGGCCTCGGC C AAGAAAGC C A?n 

CAAATTTGCC AGAAACAGGT CAAGTATCTG C-GGTATCTTC T AAAAGAGG G TC^GAGATGG A*o 

CTG AC TGAGG CCAGAAAAGA GACTGTGATG GGGCAGCCTA CTCCGAAGAC CCCTCGACAA 11 An 

CTAAGGGAGT TCCTAGGGAC GGCAGGCTTC TGTCGCCTCT GGATCCCTGG GTTTGCAGAA 2Snn 

ATGGCAGCCC CCTTGTACCC TCTCACCAAA ACGGGG AC T C TGTTTAATTG GGGCCCAGAC dlzn 
CAACAAAAGG CCTATCAAGA AATCAAGCAA GCTCTTCTAA CTGCCCCAGC CCTGGGG^TG" 
CCAGATTTGA CTAAGCCCTT TGAACTCTTT GTCGACGAGA AGCAGGGCT \ CGCCAAA GGT 
GTCCTAACGC AAAAACTGGG ACCTTGGCGT CGGCCGGTGG CCTACCTGTC C AAAAAGC T A 
GACCCAGTAG CAGCTGGGTG GCCCCCTTGC CTACGGATGG TAGCAGCCA^ TGCCGT\CTG 
ACAAAGGATG CAGGCAAGCT AACCATGGGA CAGCCACTAG TCATTCTGGC CCCCCATGCA 
GTAGAGGCAC TAGTCAAACA ACCCCCCGAC CGCTGGCTTT CCAACGCCCG GATGACTCAC 
TATCAGGCCT TGCTTTTGGA CACGGAC CGG GTCCAGTTCG GACCGGTGGT AGCCCTGAAC 
CCGGCTACGC TGCTCCCACT GCCTGAGGAA GGGCTGCAAC ACAACTGCCT TGATATCCTG 
GCCGAAGCCC ACGGAAC CCG ACCCGACCTA ACGGACCAGC CGCTCCCAGA CGCCGACCAC 
ACCTGGTACA CGGATGGAAG CAGTCTCTTA CAAGAGGGAC AGCGTAAGGC GGGAGCTGCG 

GTGACCACCG AGACCGAGGT AAT CTGGGCT AAAGCCCTGC CAGCCGGGAC ATCCGCTCAG S7?n 

CGGGCTGAAC TGATAGCACT CACCCAGGCC CTAAAGATGG CAGAAGGTAA GAAGCTAAAT 52 8 0 

GTTTATACTG ATAGCCGTTA TGCTTTTGCT ACTGC C CAT A TCCATGGAGA AATATACAGA 5340 

AGGCGTGGGT TGCTCACATC AGAAGGCAAA GAGATCAAAA ATAAAGACGA GATCTTGGCC 5400 

CTACTAAAAG CCCTCTTTCT GCCCAAAAGA CTTAGCATAA TCCATTGTCC AGGACATCAA 5460 

AAGGGACACA GCGCCGAGGC TAGAGGCAAC CGGATGGCTG ACCAAGCGGC CCGAAAGGCA 5 52 0 

GC CATC AC AG AGACTCCAGA CACCTCTACC CTCCTCATAG AAAATTCATC ACCCTACACC 5SA0 

TCAGAACATT TTCATTACAC AGTGACTGAT AT AAAGGAC C TAACCAAGTT GGGGGCCATT 5 640 

TATGATAAAA CAAAGAAGTA TTGGGTCTAC CAAGGAAAAC CTG TG AT GC C TGACCAGTTT 57 00 

AC TTTTG AAT TATTAGACTT TCTTCATCAG CTGACTCACC TCAGCTTCTC AAAAATGAAG 57 6 0 

GCTCTCCTAG AG AG AAG CCA CAGTCCCTAC TACATGCTGA AC C GGGATCG AACAC^CAAA 5 82 0 

AATATCACTG AGACCTGCAA AGCTTGTGCA CAAGTCAACG CCAGCAAGTC TGC CGTTAAA 5 880 

CAGGGAACTA GGGTC CGCGG GCATCGGCCC GGCACTCATT GGGAGATCGA TTTCACCGAG 594 0 

ATAAAGCCCG GATTGTATGG CTATAAATAT CTTCTAGTTT TT AT AG AT AC CTTTTCTGGC 6000 

TGGATAGAAG CCTTCCCAAC CAAGAAAGAA AC CGC CAAGG TCGTAACCAA GAAGCTACTA 6060 

GAGGAGATCT TCCCCAGGTT CGGCATGCCT CAGGT ATTGG GAACTGACAA TGGGCCTGCC 612 0 

TTCGTCTCCA AGGTGAGTCA GACAGTGGCC GATCTGTTGG GGATTGATTG G AAATT AC AT 618 0 

TGTGCATACA GACCCCAAAG CTCAGGCCAG GTAGAAAGAA TGAATAGAAC CATCAAGGAG 624 0 

ACTTTAACTA AATTAACGCT T GCAACTG GC TCTAGAGACT GGGTGCTCCT ACTCCCCTTA 63 0 0 

GCCCTGTACC GAGCCCGCAA CACGCCGGGC CCCCATGGCC TCACCCCATA TG AG AT CTT A 6 3 60 

TATGGGGCAC CCCCGCCCCT TGTAAACTTC CCTGACCCTG ACATGACAAG AGTTAC^AAC 6 42 0 

AGCCCCTCTC TCCAAGCTCA CTTACAGGCT CTCTACTTAG T C C AGCAC GA AGTCTGGAGA 6480 

CCTCTGGCGG CAGCCTACCA AGAACAACTG GACCGACCGG TGGTACCTCA CCCTT^CCGA 6 54 0 

GTCGGCGACA CAGTGTGGGT CCGCCGACAC CAGACTAAGA ACCTAGAACC TCGCTGGAAA 6 6 00 

GG AC C T T AC A CAGTCCTGCT GACCACCCCC ACCGCCCTCA AAGTAGACGG CATC G CAG CT 6 65 0 

TGGATACACG CCGCCCACGT GAAGGCTGCC GACCCCGGGG G TGG AC CATC C T CT AG AC T G 672 0 

ACATGGCGCG TTCAACGCTC TCAAAACCCC TTAAAAATAA GGTTAACCCG CGAGGCCCCC 6780 

TAATCCCCTT AATTCTTCTG ATGCTCAGAG GGGTCAGTAC TGCTTCGCCC GGCTCCAGTG 6 84 0 

CGGCCCAGCC GGCCACCATG AAAACATTTA ACATTTCTCA ACAAGATCTA GAATTAGTAG 6 90 0 

AAGTAGCGAC AGAGAAGATT ACAATGCTTT ATGAGGATAA TAAACATCAT GTGGGAGCGG 6 9 60 

CAATTCGTAC GAAAACAGGA GAAATCATTT CGGCAGTACA TATTGAAGCG . TAT AT \GGAC 702 0 

GAGTAACTGT TT G T GC AG AA GCCATTGCGA TTGGTAGTGC AGTTT CGAAT GGACAAAAGG 708 0 

AT TT T G AC AC GATTGTAGCT GTTAGACACC CTTATTCTGA CGAAGTAGAT AGAAGTA T TC 714 0 

GAGTGGTAAG TCCTTGTGGT ATGTGTAGGG AGTTGATTTC AGACTATGCA C CAG ATT G TT 720 0 

TTGTGTTAAT AG AAA TG AAT GGCAAGTTAG T CAAAAC TAG GATTGAAGAA CTCATTCCAC 72 6 0 

TCAAATATAC CCGAAATTAA AAGTTTTACC ACCAAGCTTA TCGAATTC 73 0 8 
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60 



CATATGCGGT GTGAAATACC GCACAGATGC GTAAGGAGAA AATACCGCAT CAGGCGCCAT ou 
TCGCCATTCA GGCTGCGCAA CTGTTGGGAA GGGCGATCGG TGCGGGCCTC TTCGCTJTTA * 120 

CGCCAGCTGG CGAAAGGGGG ATGTGCTGCA AGGCGATTAA GTTGGGTAAC GC C AGGGTTT 1 flft 

TCCCAGTCAC GACGTTGTAA AACGACGGCC AGTGAATTCC GATTAGTTCA ATTTGTTAAA 0 

GACAGGATCT CAGTAGTCCA GGCTTTAGTC CTGACTCAAC AAT AC CAC CA GCTAAAACCA ton 

CTAGAATACG AGC CAC AAT A AATAAAAGAT TTTATTTAGT TTCCAGAAAA AGGGGGGAAT 3 60 

GAAAGACCCC ACCAAATTGC TTAGCCTGAT AGCCGCAGTA ACGCCATTTT GCAAGGCATG 42 0 

GAAAAATACC AAA CC AAGAA TAGAGAAGTT CAGATCAAGG GCGGGTACAC GAAAACAGCT 4fl0 

AACGTTGGGC CAAACAGGAT ATCTGCGGTG AGCAGTTTCG GCCCCGGCCC GGGGCCAAGA 54 0 

ACAGATGGTC ACCGCGGTTC GGCCCCGGCC CGGGGCCAAG AACAGATGGT CCCCAGATAT 60 0 
GGCCCAACCC TCAGCAGTTT CTTAAGACCC AT C AGATGTT TCCAGGCTCC CCCAAGGACC " 66 0 
TGAAATGACC CTGTGCCTTA TTTGAATTAA CCAATCAGCC TGCTTCT CG C TTrTTTTrrir 
:^«AG^ 4 CiATAAAAGA (jCXUACAACC CCTCACTCGG CGCGCCAGTC 



CTC CG AT AG A CTGAGTCGCC CGGGTACCCG TGTATCCAAT AAATCCTCTT 1 GCTGTTGCAT 8 40 

CCGACTCGTG GTCTCGCTGT TCCTTGGGAG GGTCTCCTCA GAGTGATTGA CTACCCGTCT 



780 
840 

CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCTGGAGA CCCCTGCCCA GGGACCACCG 9 60 

ACCCACCACC GGGAGGTAAG CTGGCCAAGA TCTTATATGG GGCACCCCCG CCCCTTGTAA 102 0 

ACTTCCCTGA CCCTGACATG ACCAGAGTTA CTAACAGCCC CTCTCTCCAA GCTCACTTAC lOflO 

AGGCTCTCTA CTTAGTCCAG CACGAAGTTT GG AG AC CAC T GGCGGCAGCT TACCAAGAAC 114 0 

AACTGGACCG GCCGGTGGTG CCTCACCCTT ACCGGGTCGG C G AC ACAGTG TGGGTCCGCC 1200 

GACATCAAAC CAAGAACCTA GAACCTCGCT GGAAAGGACC TTACACAGTC CTGCTGACCA 1 

CCCCCACCGC CCTCAAAGTA GACGGTATCG CAGCTTGGAT ACACGCAGCC CACGTAAAGG 13 2 0 

CGGCCGACAC CGAGAGTGGA CCATCCTCTG GACGGACATG GCGCGTTCAA CGCTCTCAAA 13 8 0 

ACCCCCTCAA GATAAGATTA ACCCGTGGAA GCCCTTAATA GTCATGGGAG TCCTGTTAGG 144 0 

AGTAGGGATG GCAGAGAGCC CCCATCAGGT CTTTAATGTA ACCTGGAGAG TCACCAACCT 15 00 

GATGACTGGG CGTACCGCCA ATGCCACCTC CCTCCTGGGA ACTGTACAAG ATGCCTTCCC 1560 

AAAATTATAT TTTGATCTAT GTGATCTGGT C GG AG AGG AG TGGGACCCTT CAGACCAGGA 162 0 

ACCGTATGTC GGGTATGGCT GCAAGTACCC CGCAGGGAGA CAGCGGACCC GGACTTTTGA 168 0 

CTTTTACGTG TGCCCTGGGC ATAC CGT AAA GTCGGGGTGT GGGGGACCAG GAGAGGGCTA 1740 

CTGTGGTAAA TGGGGGTGTG AAACCACCGG ACAGGCTTAC TGGAAGCCCA CATCATCGTG 1800 

GGACCTAATC TCCCTTAAGC GCGGTAACAC CCCCTGGGAC ACGGGATGCT CTAAAGTTGC 18 60 

CTGTGGCCCC TGCTACGACC TCTCCAAAGT ATCCAATTCC TTCCAAGGGG CTACTCGAGG 192 0 

GGGCAGATGC AACCCTCTAG TCCTAGAATT CACTGATGCA GGAAAAAAGG CTAACTGGGA 19 80 

CGGGCCCAAA TCGTGGGGAC TGAGACTGTA CCGGACAGGA ACAGATCCTA TTACCATGTT 2 0 40 

CTCCCTGACC CGGCAGGTCC TTAATGTGGG ACCCCGAGTC CCCATAGGGC CCAACCCAGT 2100 

ATTACCCGAC CAAAGACTCC CTTCCTCACC AATAGAGATT GTACCGGCTC CACAGCCACC 2160 

TAGCCCCCTC AATACCAGTT ACCCCCCTTC CACTACCAGT ACACCCTCAA CCTCCCCTAC 222 0 

AAGTCCAAGT GTCCCACAGC CACCCCCAGG AACTGGAGAT AGACTACTAG CTCTAGTCAA 22 8 0 

AGGAGCCTAT CAGGCGCTTA ACCTCACCAA TCCCGACAAG ACCCAAGAAT GTTGGCTGTG 23 4 0 

CTTAGTGTCG GGACCTCCTT ATTACGAAGG AGTAGCGGTC GTGGGCACTT ATACCAATCA 2 40 0 

TTCCACCGCT CCGGCCAACT GTACGGCCAC TTCCCAACAT AAGCTTACCC TATCTGAAGT 2 4 60 

GACAGGACAG GGCCTATGCA TGGGGGCAGT ACCTAAAACT CACCAGGCCT TATGTAACAC 2 5 20 

CACCCAAAGC GCCGGCTCAG GATCCTACTA CCTTGCAGCA CCCGCCGCAA CAATGTGGGC 2 5 80 

TT GC AGC ACT GGATTGAC T C CCTGCTTGTC CACCACGGTG CTCAATCTAA C CAC AGATT A 2 6 40 

TTGTGTATTA GTTGAACTCT GGCCCAGAGT AATTT AC CAC TCCCCCGATT ATATGTATGG 27 00 

TCAGCTTGAA CAGCGTACCA AATATAAAAG AGAGC C AGT A TCATTGACCC TGGCCCTTCT 27 60 

ACTAGGAGGA TTAACCATGG GAGGGATTGC AG C TGGAAT A GGGACGGGGA CCACTGCCTT 2 82 0 

AATTAAAACC CAGCAGTTTG AGCAGC TTC A TGCCGCTATC C AGAC AG AC C TCAACGAAGT 2 8 80 

CGAAAAGTCA ATT AC CAAC C TAGAAAAGTC ACTGACCTCG TTGTCTGAAG TAGTCCTACA 2 9 40 

GAACCGCAGA GGCCTAGATT TGCTATTCCT AAAGGAGGGA GGTCTCTGCG CAGC CCT AAA 3 000 

AGAAGAATGT TGTTTTTATG CAGACCACAC GGGGCTAGTG AGAGACAGCA TGGCCAAATT 3 0 60 

AAGAGAAAGG CTTAATCAGA GACAAAAACT ATTTGAGACA GGCCAAGGAT GGTTCGAAGG 312 0 

GCTGTTTAAT AGATCCCCCT GGTTTACCAC CTTAATCTCC AC CAT C ATGG GACCTCTAAT 3180 

AGTACTCTTA CTGATCTTAC TCTTTGGACC TTGCATTCTC AATCGATTAG TTCAATTTGT 3 2 40 

TAAAGACAGG ATCTCAGTAG TCCAGGCTTT AGTCCTGACT C AAC AAT AC C ACCAGCTAAA 3 3 00 

GC CTATAGAG TACGAGCCAT AGGGCGCCTA GTGTTGACAA TTAATCATCG GCATAGTATA 3 3 60 

CGGCATAGTA TAATACGACT CACTATAGGA GGGCCACCAT GGCCAAGTTG ACCAGTGCCG 3 42 0 

TTCCGGTGCT CACCGCGCGC GACGTCGCCG GAGCGGTCGA G TTC T GG AC C GACCGGCTCG 3 4 30 

GGTTCTCCCG GGACTTCGTG GAGG ACGAC T TCGCCGGTGT GGTCCGGGAC GACGTGACCC 3 5 40 

TGTTCATCAG C GC GG TC C AG G AC C AGGTGG TGCCGGACAA CACCCTGGCC TGGGTGTGGG 3 60 0 

TGCGC GGCCT GG AC G AGC T G TACGCCGAGT GG TC GG AGG T CGTG7CCACG AACTTCCGGG 3 6 60 

ACGCCTCCGG GCCGGCCATG ACCGAGAT CG GCGAGCAGCC GTGGGGGCGG GAGTTCGCCC 3 72 0 

TGCGCGACCC GGCCGGCAAC TGCGTGCACT TCGTGGCCGA GGAGC AGG A C TGANNNNCGG 3 7 80 

ACCGGTCGAC TTGTTAACTT GTTTATTGCA GCTTATAATG GTTACAAATA AAGCAATAGC 3 8 40 

ATCACAAATT TCACAAATAA AGCATTTTTT TCACTGCATT CTAGTTGTGG TTTGTCCAAA 3 900 

CTCATCAATG TATCTTATCA TGTCTGGATC CAGATCTGGG CCCATGCGGC CGCGGATCGA 3 9 60 

TNNNNACATG TGAGCAAAAG G C C AG CAAAA GGCCAGG AAC CGTAAAAAGG CCGCGTTGCT 402 0 

GGCGTTTTTC CATAGGCTCC GCCCCCCTGA CGAGCATCAC AAAAATCGAC GCTCAAGTCA 4030 
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GAGGTGGCC-A 

CGTGCGCTCT 

GGG AAGCGTG 

TCGCTCCAAG 

CGGTAACTAT 

CACTGGTAAC 

GTGGCCTAAC 

AGTTACCTTC 

CGGTGGTTTT 

TCCTTTGATC 

TTTGGTCATG 

TTTTAAATCA 

CAGTGAGGCA 

CGTCGTGTAG 

ACCGCGAGAC 

GGCCGAGCGC 

CCGGGAAGCT 

TACAGGCATC 

ACGATCAAGG 

TCCTCCGATC 

ACTGCATAAT 

CTCAACCAAG 

AATACGGGAT 

TTCTT CGGGG 

CACTCGTGCA 

AAAAACAGGA 

ACTCATACTC 

CGGATACATA 

CCGAAAAGTG 

TAGGCGTATC 

ACACATGCAG 

AGCCCGTCAG 

ATCAGAGCAG 



AACCCGACAG 

CCTGTTCCGA 

GCGCTTTCTC 

CTGGGCTGTG 

CGTCTTGAGT 

AGGATTAGCA 

TACGGCTACA 

GG AAAAAGAG 

TTTGTTTGCA 

TTTTCTACGG 

AGATTATCAA 

ATCTAAAGTA 

CCTATCTCAG 

ATAACTACGA 

CCACGCTCAC 

AGAAGTGGTC 

AGAGTAAGTA 

GTGGTGTCAC 

CGAGTTACAT 

GTTGTCAGAA 

TCTCTTACTG 

TCATTCTGAG 

AATACCGCGC 

CGAAAACTCT 

CCCAACTGAT 

AGGCAAAATG 

TTCCTTTTTC 

TTTGAATGTA 

CCACCTGACG 

ACGAGGCCCT 

CTCCCGGAGA 

GGCGCGTCAG 

ATTGTACTGA 



GACTATAAAG 

CCCTGCCGCT 

AATGCTCACG 

TGCACGAACC 

CCAACCCGGT 

GAGCGAGGTA 

CTAGAAGGAC 

TTGGTAGCTC 

AGCAGCAGAT 

GGTCTGACGC 

AAAGGATCTT 

TATATGAGTA 

CGATCTGTCT 

TACGGGAGGG 

CGGCTC CAGA 

CTGCAACTTT 

GTTCGCCAGT 

GCTCGTCGTT 

GATCCCCCAT 

GTAAGTTGGC 

TCATGCCATC 

AATAGTGTAT 

CACATAGCAG 

CAAGGATCTT 

CTTCAGCATC 

CCGCAAAAAA 

AATATTATTG 

TTTAGAAAAA 

TCTAAGAAAC 

TTCGTCTCGC 

CGGTCACAGC 

CGGGTGTTGG 

GAGTGCAC 



ATACCAGGCG 

TACCGGATAC 

C TGT AGGTAT 

CCCCGTTCAG 

AAGACACGAC 

TGTAGGCGGT 

AGTATTTGGT 

TTGATCCGGC 

TACGCGCAGA 

TCAGTGGAAC 

CACCTAGATC 

AACTTGGTCT 

ATTTCGTTCA 

CTTACCATCT 

TTTATCAGCA 

ATCCGCCTCC 

TAATAGTTTG 

TGGTATGGCT 

GTTGTGCAAA 

CGCAGTGTTA 

CGTAAGATGC 

GCGGCGACCG 

AACTTTAAAA 

ACCGCTGTTG 

TTTTACTTTC 

GGGAATAAGG 

AAGCATTTAT 

TAAACAAATA 

CATTATTATC 

GCGTTTCGGT 

TTGTCTGTAA 

CGGGTGTCGG 



TTTCCCCCTG 

CTGTCCGCCT 

CTCAGTTCGG 

CCCGACCGCT 

TTATCGCCAC 

GCTACAGAGT 

ATCTGCGCTC 

AAACAAAC C A 

AAAAAAGGAT 

GAAAACTCAC 

CTTTTAAATT 

GACAGTTACC 

TCCATAGTTG 

GGCCCCAGTG 

ATAAACCAGC 

ATCCAGTCTA 

CGCAACGTTG 

TCATTCAGCT 

AAAGCGGTTA 

TCACTCATGG 

TTTTCTGTGA 

AGTTGCTCTT 

GTGCTCATCA 

AGATCCAGTT 

ACCAGCGTTT 

GCGACACGGA 

CAGGGTTATT 

GGGGTTCCGC 

ATGACATTAA 

GATGACGGTG 

GCGGATGCCG 

GGCTGGCTTA 



GAAGCTCCCT 

TTCTCCCTTC 

TGTAGGTCGT 

GCGCCTTATC 

TGGCAGCAGC 

TCTTGAAGTG 

TGCTGAAGCC 

CCGCTGGTAG 

CTCAAGAAGA * 

GTTAAGGGAT 

AAAAATGAAG 

AATGCTTAAT 

CCTGACTCCC 

CTGCAATGAT 

CAGCCGGAAG 

TTAATTGTTG 

TTGCCATTGC 

CCGGTTCCCA 

GCTCCTTCGG 

TTATGGCAGC 

CTGGTGAGTA 

GCCCGGCGTC 

TTGGAAAACG 

CGATGTAACC 

CTGGGTGAGC 

AATGTTGAAT 

GTCTCATGAG 

GCACATTTCC 

CCTATAAAAA 

AAAACCTCTG 

GGAGCAGACA 

ACTATGCGGC 
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Figure 10. FBdelPMOSAF Sequence l 

TCGCCA™ GGCT^r^ ^CAGATGC GTAAGGAGAA AATACCGCAT CAGGCGCCAT S n 

SSSSS SSSSSS ;™ G T £^ C J£ £ ggg ™ a I • 12 5 o° 
£££££ g£E?£ J££iS& SJSSS SSS?2S 

SSSSS £££££ SSSSSS SSSSS o 4 § 

r^?^ agcaaa ™^ TrSSSSi SSSSS ESSiS 360 

gaaaaatacc aaaccaagaa tagagaagtt cagatcaagg Grrr^T-AaA ^«VVrS TC 420 

aacgttgggc caaacaggat atct£S££ otg^cc c^r^I- 480 

acagatggtc accgcggttc ggccccggcc cggggccaag aacaga^? c"^fr™ 540 

ggcccaaccc tcagcagttt cttaagaccc atcagatgtt tcSggc^c cc^^lrr 600 

TGAAATGACC CTGTGCCTTA TTTGAATTAA GCAATCAGCC TG^CTrr? SSS^f^^ S 6° 
GCGCTTCTGC TTCCrnAtyyr ^^"VVii^V ^^V-:^'? OC:t : TGCTTCTCGC TTCTGTTCGC 7?n 

CTCCGATAGA CTGAGTCGCC CGGGTACCCg" TGTATCCAAT AAATr£<£^ ~^ CCAGTC 7 80 

CCGACTCGTG GTCTCGCTGT TCCTTGGGAG GGTCTCCtcI GCTGTTGCAT 840 

CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCTGGAGA SS^SS 9 °° 

ACCCACCACC GGGAGGTAAG CTGGCCAAGA TCTTATATr? r^ar^^ GGGACCACCG 960 

ACTTCCCTGA CCCTGACATG ACAAGAGTTA CTCTC^CcS r^T^^ 1020 
AGG f I^S™ CTTAGTCCAG C AC G AAGTCT GG^lcSS SSSSSS TACCAAGAAC 

AACTGGACCG ACCGGTGGTA CCTCACCCTT KCCCAnrnr rrT^T^V^; 1C CCAAGAAC 1140 

GACACCAGAC TAAGAACCTA SSSSSS ££££££ T^ACACAGTC 12 °° 

CCCCCACCGC CCTCAAAGTA GACGGCATCG CAGCTTGGAT ACACGCCGCC 2S^££ 1260 

CTGCCGACCC CGGGGGTGGA CCATCCTCTA cirTra^n £^™£S° C CACGTGAAGG 1320 

ACCCCTTAAA AATAAGGTTA SSSSS gg£g£ ££££££ 13 8 ° 

CAGAGGGGTC AGTACTGCTT CGCCCGGCTC rzrTrr^lv r-^r-i^tZT TTCTGATGCT 1440 

GGAGGTAACC AATGGAGATC S£SS£? ATGGGCAACT S " 00 

GACCTGGTGG CCTGACCTTA CCCCAGATTT ATGTATGTTa ^rr^TS~VS2 AC CCTCTGTG 15 60 

TTGGGGGCTA GAATATCAAT CCCCTTTTTC °^? ATCTTA 162 0 
GGGCAGCAGC CCAGGCTGT? C« SSSSS ™c^C ^cIc^CTcl 

GCCTCGAACA C-ACTCAAGCT AGACCAGACA ACTCATAAAT 2Eu52S? 4 ° 
£2™^ TGCCCCG GGC CCCACCGCCC CCGAGAATCC AAGTCATGTG Sg^TC^ 

Tl^lZ? GGGGCTGTGA GACAACCGGT AGAGCTTACT G^^^c 
CTCATCATGG GATTTCATCA CAGTAAACAA CAATCTCACC TCTGArCAr*- ™£r^,:~;£ 

GGTTACTTCC JSSJgS SSjSSS™ SSSK 

^:^rTi„„ tggaccacag gacattactg gggcttacgt ttgtatgtct cccGACAAra 
^ A rr r S G 5 A tccgactcag ataccaaaat ctagSIScc gcctcc^t 

r^?^™ CCCGTTCTGG CAGACCAACA GCCACTCTCC AAGCCCAAAC CTGTTAAGTC 
SSST7ESS TC ACC AAACCAC CCAGTGGGAC TCCTCTCTCC CCTACCCAAC TTCCArCGfir 
SSS££5SS?* ^ TAGGC TGC TAAACTTAGT AGACGGAGCC TACCAAGCCC 
SSSEZS^ AAAACCCAAG AGTGCTGGTT GTGTCTAGTA GCGGGACCCC CC^CTA^ 
iSSSS™?5 GTCCTGGGTA CCTACTCCAA CCATACCTCT GCTCCAGCCA AC^^CCCT 
^^^o CAA CACAAGTTGA CCCTGTCCGA AGTGACCGGA CAGGGACTCT gSSg^AG? 
^I^S™ ACACATCA GG CCCTATGTAA TACCACCCAG ACAAGCAGTC GAG^GTCOTA 

GGCCCTA CAG GTACCATGTG GGCTTGTAGT ACCGGGCTTA C^S^AT 
"CCACCACC ATACTGAACC TTACCACTGA TTATTGTGTT CTTGTCGAAC TCTGGC^AO 

CATTCCCCCA GCTATGTTTA CGGCCTGTTT GAGAGATCCA A^^C^ 
AAGAGAACCG GTGTCGTTAA CCCTGGCCCT ATTATTGGGT GGACTAACCA TGCCOfirjir 
^AA^™ ATAGGAACA ° GGACTACTGC TCTAATGGCC ACTCAGCAAT ^cIaISg^T 
oS^^^ GTACAGGATG ATCTCAGGGA GGTTGAAAAA TCAATCTCTA ACCTAGAAAA 
TC^J^l AAGTTGTCCT ACAGAATCGA AGGGGCcSg ACT^G^A^ 

TCTAAAAGAA GGAGGGCTGT GTGCTGCTCT AAAAGAAGAA TGTTGCTTCT ATCTO-irri 

ac^t^g ?c^c^ A ^ gcatggccaa ^cagagag aSSS^S? Sa m 

ttgag TCAACTCAAG GATGGTTTGA GGG AC TGTTT AACAGATCCG CTTCTTTTar 

^™ ATA TCTACCATTA TGGGACCCCT CATTGTACTC CT^TC^ TCC^C^G^ 

TTTAGTCCTG ^T^rtH ^AGTTCAATT TGTTAAAGAC AGGATCTCAG 

c?Ig??t^? r^I^^ ACCACCAGCT AAAGCCTATA GAGTACGAGC CATAGC3GCGC 

^ A ?^S TTGA CAATTAATCA TCGGCATAGT ATACGGCATA GTATAATACG ACTCACTATA 

^^?? GCCAC CATGGCCAAG TTGAC C AGTG CCGTTCCGGT GCTCACCGCG CGgScGT^G 

CCGGAGCGGT CGAGTTCTGG ACCGACCGGC TCGGGTTCTC CCGGGACTTC GTGGArJIrr 

T^GCCG^A SSSE^ GACGACGTGA CCCTGTTCAT CAG^GG^C Sg^AgS 

TS2ISSS GGA ^caccctg gcctgggtgt gggtgcgcgg cctggacg^g ctgtacgccg 
agtggtcgga ggtcgtgtcc acgaacttcc gggacgcctc cgggccggcc atcaccgaga 
T^SS CGAGCA gccgtggggg cgggagttcg ccctgcgcga cccggccg^c ^I^gtg^ 
acttcgtggc cgaggagcag gactgannnn cggaccggtc gacttgttaa c^tS^SS 

GCAGCTTATA ATGGTTACAA ATAAAGCAAT AGCATCACAA ATTTCACAAA ^Slm 

TTTTCACTGC attctagttg tggtttgtcc aaactcatca atgtatctta tcI^c^ 

ATCCAGATCT GGGCCCATGC GGCCGCGGAT CGATNNNNAC ATG TG AGCAA I^G^CAG^ 
AAAGGCCAGG AACCGTAAAA AGGCCGCGTT GCTGGCGT^T ttccataggc tcc^cccc^ 
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Figure 10. FBdelPMOSAF Sequence 



TGACGAGCAT 

AAGATACCAG 

GCTTACCGGA 

ACGCTGTAGG 

ACCCCCCGTT 

GGTAAGACAC 

GTATGTAGGC 

GACAGTATTT 

CTCTTGATCC 

GATTACGCGC 

CGCTCAGTGG 

CTTCACCTAG 

GTAAACtTGG 

TCTATTTCGT 

GGGCTTACCA 

AGATTTATCA 

TTTATCCGCC 

AGTTAATAGT 

GTTTGGTATG 

CATGTTGTGC 

GGCCGCAGTG 

ATCCGTAAGA 

TATGCGGCGA 

CAGAACTTTA 

CTTACCGCTG 

ATCTTTTACT 

AAAGGGAATA 

TTGAAGCATT 

AAATAAACAA 

AAC C ATT AT T 

CGCGCGTTTC 

AGCTTGTCTG 

TGGCGGGTGT 

C 



CACAAAAATC 

GCGTTTCCCC 

TACCTGTCCG 

TATCTCAGTT 

CAGCCCGACC 

GACTTATC GC 

GGTGCTACAG 

GGTATCTGCG 

GG C AAA C AAA 

AGAAAAAAAG 

AACGAAAACT 

ATCCTTTTAA 

TCTGACAGTT 

TCATCCATAG 

TCTGGCCCCA 

GCAATAAACC 

TCCATCCAGT 

TTGCGCAACG 

GCTTCATTCA 

AAAAAAGCGG 

TTATCACTCA 

TGCTTTTCTG 

CCGAGTTGCT 

AAAGTGCTCA 

TTGAGATCCA 

TTCACCAGCG 

AGGGCGACAC 

TATCAGGGTT 

AT AGGGGT T C 

ATCATGACAT 

GGTGATGACG 

TAAGCGGATG 

CGGGGCTGGG 



GACGCTCAAG 

CTGGAAGCTC 

CCTTTCTCCC 

CGGTGTAGGT 

GCTGCGCCTT 

CACTGGCAGC 

AGTTCTTGAA 

CTCTGCTGAA 

CCACCGCTGG 

GATCTCAAGA 

CACGTTAAGG 

ATTAAAAATG 

ACCAATGCTT 

TTGCCTGACT 

GTGCTGCAAT 

AGCCAGCCGG 

CTATTAATTG 

TTGTTGC CAT 

GCTCCGGTTC 

TTAGCTCCTT 

TGGTTATGGC 

TGACTGGTGA 

CTTGCCCGGC 

TCATTGGAAA 

GTTCGATGTA 

TTTCTGGGTG 

GGAAATGTTG 

ATTGTCTCAT 

CGCGCACATT 

TAACCTATAA 

GTGAAAACCT 

CCGGGAGCAG 

TTAACTATGC 



17/22 

2 

TCAGAGGTGG 
CCTCGTGCGC 
TTCGGGAAGC 
CGTTCGCTCC 
ATCCGGTAAC 
AGCCACTGGT 
GTGGTGGCCT 
GCCAGTTACG 
TAGCGGTGGT 
AGATCCTTTG 
GATTTTGGTC 
AAGTTTTAAA 
AATCAGTGAG 
CCCCGTCGTG 
GATACCGCGA 
AAGGGCCGAG 
TTGCCGGGAA 
TGCTACAGGC 
CCAACGATCA 
CGGTCCTCCG 
AGCACTGCAT 
GTACTCAACC 
GTCAATACGG 
ACGTTCTTCG 
ACCCACTCGT 
AGCAAAAACA 
AATACTCATA 
GAGCG GAT AC 
TCCCCGAAAA 
AAATAGGCGT 
CTGACACATG 
ACAAGCCCGT 
GG CATC AG AG 
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CGAAACCCGA 
TCTCCTGTTC 
GTGGCGC TTT 
AAG CTGGGCT 
TATCGTCTTG 
AACAGGATTA 
AACTACGGCT 
TTCGGAAAAA 
TTTTTTGTTT 
ATCTTTTCTA 
ATGAGATTAT 
TCAAT C T AAA 
GCACCTATCT 
TAGATAACTA 
GACCCACGCT 
CGCAGAAGTG 
GCTAGAGTAA 
ATCGTGGTGT 
AGGCGAGTTA 
ATCGTTGTCA 
AATTCTCTTA 
AAGTCATTCT 
GATAATACCG 
GGGCGAAAAC 
GCACCCAACT 
GGAAGGC AAA 
CTCTTCCTTT 
ATATTTGAAT 
GTGCCACCTG 
ATCACGAGGC 
CAGCTCCCGG 
CAGGGCGCGT 
CAGATTGTAC 



CAGGACTATA 
CGACCCTGCC 
CTCAATGCTC 
GTGTGCACGA 
AGTCCAACCC 
GCAGAGCGAG 
ACACTAGAAG 
GAGTTGGTAG- 
GCAAGCAGCA 
CGGGGTCTGA 
CAAAAAGGAT 
GTATATATGA 
CAGCGATCTG 
CGATACGGGA 
CACCGGCTCC 
GTC CTGCAAC 
GTAGTTCGCC 
CACGCTCGTC 
CATGATCCCC 
GAAGTAAGTT 
CTGTCATGCC 
GAGAATAGTG 
CGCCACATAG 
TCTCAAGGAT 
GATCTTCAGC 
ATG C CGC AAA 
TTCAATATTA 
GTATTTAGAA 
ACGTCTAAGA 
CCTTTCGTCT 
AGACGGTCAC 
CAGCGGGTGT 
TGAGAGTGCA 
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Figure H. FBdelPGASAF Sequence 1 
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TC^S^I Sc^rr^ ^S^ TGC GTAAGGAGAA AATACCGCAT CAGGCGCCAT 
^5r C ^" TCA G^CTGCGCAA CTGTTGGGAA GGGCGATCGG TGCGGGCCTC TTCGCTATTA 
CGAAAGGGGG ATGTGCTGCA AGGCGATTAA GTTGGGTAAC GCCAcSm 
rArf^IS^ GACGTTGTAA AACGACGGCC AGTGAATTCC CArTAG^A A^G^aII 240 
GACAGGATCT CAGTAGTCCA GGCTTTAGTC CTGACTCAAC AATACCACCA GCTAAAACct Inn 
rl*%£r?£r AGCCACAATA AATAAAAGAT TTTATTTAGT TTCCAGAAaX £££££££ H° 0 

™ GACCCC accaa attgc ttagcctgat agccgcagta acgccatttt gcaaIgEatg 

GAAAAATACC AAACCAAGAA TAGAGAAGTT CAGATCAAGG GCC-GGTACAC GAAAACAG^ An 
AACGTTGGGC CAAACAGGAT ATCTGCGGTG AGCAGTTTCG CCCCCGGCCC S££SS' 
A S AGATGGTC ACCGCGGTTC GGCCCCGGCC CGGGGCCAAG AACAGATGGT CCCCAGATAT 
GGCCCAACCC TCAGCAGTTT CTTAAGACCC ATCAGATGTT TCCAGGCTCC CCCAAGGACr 

S£™c?™ H? GAATTAA ^ TCA f c tgcttctcgc ttSS???£ 
^f^TAGA ctgagtcgcc cgggtacccg tcta^ccaXt gc^tSSt Ho 

ccgactcgtg gtctcgctgt tccttgggag ggtctcctca gagtgattga CTACCCGTTT 

CG ^I CTT TCATTTGGGG GCTCGTCCGG GATCTGGAGA CCCCTGCC^ G^ACC^CG 
A ^ ACCACC GGGAGGTAAG CTGGCCAAGA TCCCTAAGGT ACTCGGGTCA GACAATGGCC 
CGGCCTTTGT TGCTCAGGTA AGTCAGGGAC TGGCCACTCA ACTGGGGATA AATTGGAAGT 
^CATTCTGC GTATAGACCC CAGAGCTCAG GTCAGGTAGA AAGAATGAAC AGAACAATTA 
^ < tf^S TT GACCAAATTA GCCTTAGAGA CCGGTGGAAA AGACTGGGTG ACCCTCCT^ 
CCTTAGCGCT GCTTAGGGCC AGGAATACCC CTGGCCGGTT TGGTTTAACT CCTTATTAaa 
"EI£E A 122 AGGACCACCC CCCATACTTG AGTCTGGAGA AACT^TGGGT CCC^T^a" 
GA rFTSTCCC TGTCTTATTT ACTCACTTAA AGGCTTTAGA AATTGTAAGG ACCCAAATOT 
£££S*5£ T CAAAGA =GTG TATAAGCCTG GTACCGTAAC AATCCCTCAC CCCTTc£££ 
1?£GG^ AGTGCTTGTC AGACGCCATC GACCCAGCAG CCTTGAGCCT CGGTGGAAAG 
5S^ ACCT GGTGTTGCTG ACTACCCCGA CCGCGGTAAA AGTCGATGGT ATTGCTGCCT 
S^CATGC TTCTCACCTC AAACCTGCAC CACCTTCGGC ACCAGATGAG TCCTgS^ 
T GGAAAAGAG TGATCATCCT CTTAAGCTG C GTATTCGGCG GCGGCGGGAC GAGTCTGCAA 
AATAAGAACC CCCACCAGCC CATGACCCTC ACTTGGCAGG TACTGTCCCA AACTGGAGAC 
GTTGTCTGGG ATACAAAGGC AGTCCAGCCC CCTTGGACTT GGTGGCCCAC ACTTAAACCT 
$ A TSZ ATGTG CCTTGGCGGC TAGTCTTGAG TCCTGGGATA TCCCGGGAAC CGATGTCTCG 

GAGTCAGACC TCCGGACTCA GAC TATACTG CCGCTTATAA GCAAATCACC 
TGGGGAGCCA TAGGGTGCAG CTACCCTCGG GCTAGGACTA GAATGGCAAG CTCTACCTTC 
^ AGG I ATGTC CCCGGGATGG CCGGACCCTT TCAGAAGCTA GAAGGTGCGG GGGGCTAGAA 
I C .£ C J ATACT GTAAAGAATG GGATTGTGAG ACCACGGGGA CCGGTTATTG GCTATCTAAA 
TCCTCAAAAG ACCTCATAAC TGTAAAATGG GACCAAAATA GCGAATGGAC TCAAAAATTT 
Ett^ZEJ? ACCAGACCGG CTGGTGTAAC CCCCTTAAAA TAGATTTCAC AGACAAAGGA 
^TI^TSS^ AGGACTGGAT AACGGGAAAA AC C TGGGGAT TAAGATTCTA TGTGTCTGGA 
^ A I? GAGG 5 G TACAGTTCAC CATTCGCTTA AAAATCACCA ACATGCCAGC TGTGGCAGTA 
GGTCCTGACC TCGTCCTTGT GGAACAAGGA CCTCCTAGAA CGTCCCTCGC TCTCCCACCT 
CCTCTTCCCC CAAGGGAAGC GCCACCGCCA TCTCTCCCCG ACTCTAACTC CACAGCCCTG 
GCGACTAGTG CACAAACTCC CACGGTGAGA AAAACAATTG TTAC CCTAAA CACTCCGCCT 
CCCACCACAG GCGACAGACT TTTTGATCTT GTGCAGGGGG CCTTCCTAAC CTTAAATGCT 
ACCAACCCAG GGGCCACTGA GTCTTGCTGG CTTTGTTTGG CCATGGGCCC CCCTTATTAT 
™™ AG CCTCATCAGG AGAGGTCGCC TACTCCACCG ACCTTGACCG GTGCCGCTGG 
^25 AGGCA ^ G ^^^S.^ C CCTCACTGAG GTCTCAGGAC ACGGGTTGTG CATAGGAAAG 
GTGCCCTTTA CCCATCAGCA TCTCTGCAAT CAGACCCTAT CCATCAATTC CTCCGGAGAC 
CATCAGTATC TGCTCCCCTC CAACCATAGC TGGTGGGCTT GCAGCACTGG CCTCACCCCT 
TGCCTCTCCA CCTCAGTTTT TAATCAGACT AGAGATTTCT GTATCCAGGT CCAGCTGATT 
CCTCGCATCT ATTACTATCC TGAAGAAGTT TTGTTACAGG CCTATGACAA TTCTCACCCC 
AGGACTAAAA GAGAGGCTGT CTCACTTACC CTAGCTGTTT TACTGGGGTT GGGAATCACG 
G £ GGGAA T AG GTACTGGTTC AACTGCCTTA ATTAAAGGAC CTATAGACCT CCAGCAAGGC 312 0 

CTGACAAGCC TCCAGATCGC CATAGATGCT GACCTCCGGG CCCTCCAAGA CTCAGTCAGC 3 130 

^TCACTGAC TTCCCTCTCC GAGGTAGTGC TCCAAAATAG gIg^CCT 3240 
GACTTGCTGT TTCTAAAAGA AGGTGGCCTC TGTGCGGCCC TAAAGGAAGA GTGCTGTTTT 3 300 

TA.CATAGACC ACTCAGGTGC AGTACGGGAC TCCATGAAAA AACTCAAAGA AAAACTGGAT 3 3 60 

AAAAGACAGT TAGAGCGCCA GAAAAGCCAA AACTGGTATG AAGGATGGT^ CAATAACTCC 3 420 

CCTTGGTTCA CTACCCTGCT ATCAACC AT C GCTGGGCCCC TATTACTCCT CCTTCTGTTG 3 480 

CTCATCCTCG GGCCATGCAT CATCAATCGA TTAGTTCAAT TTGTTAAAGA CAGGATCTCA 3 5 40 

SI AG TSX AGG CTTTA GTCCT GACTCAACAA TACCACCAGC TAAAGCCTAT AGAGTACGAG 3S00 
CCATAGGGCG CCTAGTGTTG ACAATTAATC ATCGGCATAG TATACGGCAT AGTATAATAC 3 6 60 

GACTCACTAT AGGAGGGCCA CCATGGCCAA GTTGACCAGT GCCGTTCCGG TGCTCACCGC 3720 
GCGCGACGTC GCCGGAGCGG TCGAGTTCTG GACCGACCGG CTCGGGTTCT CCCGGGACTT 3 780 

CGTGGAGGAC GACTTCGCCG GTGTGGTCCG GGACGACGTG ACCCTGTTCA TCAGCGCGGT 3 340 

CCAGGACCAG GTGGTGCCGG ACAACACCCT GGC CTGGGTG TGGGTGCGCG GCCTGGACGA 3 90C 

GCTGTACGCC GAGTGGTCGG AGGTCGTGTC CACGAACTTC CGGGACGCCT CCGGGCCGGC 3 9 60 

^ A T^ GGAG ATCGGCGAGC AGCCGTGGGG GCGGGAGTTC GCCCTGCGCG ACCCGGCCGG 4 02 0 

CAACTGCGTG CACTTCGTGG CCGAGGAGCA GGACTGANNN NCGGACCGGT CGACTTGTTA 4 080 
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Figure 11. FBdelPGASAF Sequence 2 

>S7T?":^ TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA 4 u n 
ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT ' 2™ 2 
ATCATGTCTG GATCCAGATC TGGGCCCATG CGGCCGCGGA TCGATNNNNA oSSSE 
AAAGCCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT TGCTGGCGTT T^TCC^TA^G 
CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCG^CCG 111?, 
ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT CCCTCGTCCG CTCTC^TCT? 
S^S^CTCC C =CTTACCGG ATACCTGTCC GCCTTTCTCC CTTCGGGAAG SSSSS ° 
i CTCAATGCT C AC GCTG TAG GTATCTCAGT TCGGTGTAGG TCGTTCGCTC CAAGCTGrrr 
TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATCGTCTT 
GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT 
AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA AGTGGTGGCC TAACT^CGGC 
TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA 
AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG GTAGCGGTGG TTTTTTTGT^ 
TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT 
ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA 
?E^-^ GGA TCTTCACCTA GATCCTTTTA AATTAAAAAT GAAGTTTTAA ATCAATCTAA 
AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT TAATCAGTGA GGCACCTATC 
T^E GATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC TCCCCGTCGT GTAGATAACT 
^^552° AGGGCTTACC ATCTGGCCCC AGTGCTGCAA TGATACCGCG AGAC^C^S 
CAGATTTATC AGCAATAAAC CAGCCAGCCG GAAGGGCCGA GCGCAGAAGT 
^HSE** CTTTATCCGC CTCCATCCAG TCTATTAATT GTTGCCGGGA AGCTAGAGTA 
A ^ G I^ GC CA^AATAG TTTGCGCAAC GTTGTTGCCA TTGCTACAGG CATCGTGGTG 
TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT CCCAACGATC AAGGCGAGT" 
ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GATCGTTGTC 
AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG CAGCACTGCA TAATTCTCTT 
^?.^ rQC CA TCCGTAAG ATGCTTTTCT GTGACTGGTG AGTACTCAAC CAAGTCATTC 
TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG CGTCAATACG GGATAATACC 
°£™ CATA GCA GAACTTT AAAAGTGCTC ATCATTGGAA AACGTTCTTC GGGGCGAAAA 
ST^^^A TCTTACCGCT GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC 
T?A^TTCAG CATCTTTTAC TTTCAC CAGC GTTTCTGGGT GAGCAAAAAC AGGAAGGCAA 
AA j. GCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT GAATACTCAT ACTCTTCCTT 
ir 10 ^^ 1, ATTGAAGCAT TTATCAGGGT TATTGTCTCA TGAGCGGATA CATATTTGAA 
TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT TTCCCCGAAA AGTGCCACCT 60 fin 

GACGTCTAAG AAACCATTAT TATCATGACA TTAACCTATA AAAATAGGCG TATCACGAGG 612 0 

? C ^ CGTC TCGCG CGTTT C GGTGATGAC GGTGAAAACC TCTGACACAT GCAGCTCCcI 6^80 
^_£GGTCA CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA GACAAGCCCG TCAGGGCGCG 6240 
CTGAGAG^GC ^ GGCGGGTG TCGGGGCTGG CTTAACTATG CGGCATCAGA GCAGATTGTA "oO 



4500 

4S50 

4620 

4630 

4740 

4800 

48S0 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

S580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 



6312 



WO 97/08330 
Figure 12. FBdelPRDSAF Sequence 



CATATGCGGT 
TCGCCATTCA 
CGCCAGCTGG 
TCCCAGTCAC 
GACAGGATCT 
CTAGAATACG 
GAAAGACCCC 
GAAAAATACC 
AACGTTGGGC 
ACAGATGGTC 
GGCCCAACCC 
TGAAATGACC 
OCGCTTCTGC 



CTCCGATAGA 
CCGACTCGTG 
CGGGGGTCTT 
ACCCACCACC 
TTTATGGGGG 
AGACTGATTT 
CCCTGGCCGA 
ACTCCGTGTA 
ACATCGTCCT 
AC GCATC GCA 
AGCTCCGCCG 
ACCTTGTCCC 
CTAATAATAG 
CAACATGGTA 
ATCCAACAGG 
TGCAGAGTCA 
ACTTTCCAGG 
AATAAGACAT 
CAGATATTAC 
CCCGTTTGCT 
ACTAAGAGAG 
GAACTTCAAT 
CGGACTTTTG 
GCCCAAGATT 
CCCTCTTTAA 
CCCCTCTTGG 
AACGATACGG 
AATGTCAGTA 
GCATACACCT 
GACATTGACA 
CATAGACCTA 
G C ATT C AC C A 
CAT CAGTT AA 
G TAG ACT CGT 
GAACAAGGAG 
GGAATTGTGA 
CTGGCAACCA 
CTGGGACCCC 
CTCATGGCCT 
T AC CAAGCAC 
ATCATCGGCA 
CAAGTTGACC 
CTGGACCGAC 
CCGGGACGAC 
CCTGGCCTGG 
GTCCACGAAC 
GGGGCGGGAG 
GCAGGAC TGA 
ACAAATAAAG 
GTTGTGGTTT 
ATGCGGCCGC 
AAAAAGGCCG 
AATCGACGCT 
CCCCCTGGAA 
TCCGCCTTTC 



GTGAAATACC 
GGCTGCGCAA 
CGAAAGGGGG 
GACGTTGTAA 
CAGTAGTCCA 
AGC CACAAT A 
ACCAAATTGC 
AAACCAAGAA 
CAAACAGGAT 
ACCGCGGTTC 
TCAGCAGTTT 
CTGTGCCTTA 



GCACAGATGC 
CTGTTGGGAA 
ATGTGCTGCA 
AACGACGGCC 
GGCTTTAGTC 
AATAAAAGAT 
TTAGCCTGAT 
TAGAGAAGTT 
ATCTGCGGTG 
GGCCCCGGCC 
CTTAAGACCC 
TTTGAATTAA 



20/22 
1 

G TAAGG AG AA 
GGGCGATC GG 
AGGCGATTAA 
AGTGAATTCC 
CTGACTCAAC 
TTTATTTAGT 
AGCCGCAGTA 
CAGATCAAGG 
AGCAGTTTCG 
CGGGGCCAAG 
ATCAGATGTT 
CCAATCAGCC 
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AATACCGCAT 
TGCGGGCCTC 
GTTGGGTAAC 
GATTAGTTCA 
AATACCACCA 
TTCCAGAAAA 
ACGCCATTTT 
GCGGGTACAC 
GCCCCGGCCC 
AACAGATGGT 
TCCAGGCTCC 
TGCTTCTCGC 



CTGAGTCGCC 
GTCTCGCTGT 
TCATTTGGGG 
GGGAGGTAAG 
ACCCCCCCCT 
ACAAGCCCGA 
ATTGTACCGG 
CGTCCGGCGG 
GCTGACCACG 
CGCCAAGGCA 
TTCGGAGAAC 
TGTACTAACC 
TTCGGGCAGG 
AACCATGCGA 
TAACTTGCCC 
CTCCAAAAAT 
ACTCGATGCA 
ACTACACGGC 
AAAACCCCAA 
GGAGTGC CAC 
TGTGGACAGT 
ACCACCCCTT 
AT ATC CTGAA 
GTTGGCTCTG 
CCTACTCCCT 
TTCAACCGAT 
AACAAATAGA 
GTCCTTTATG 
ATTTACCCCA 
TCAACCCGGG 
AACGAGCTGT 
CCGGAGCTAC 
TATCTGATGT 
TAGCTGAAGT 
GAATTTGTTT 
GAAACAAAAT 
ACCCTCTCTG 
TACTCACCCT 
T CATTAATG A 
TCAAAGCTGA 
TAGTATACGG 
AGTGCCGTTC 
CGGCTCGGGT 
GTGACCCTGT 
GTGTGGGTGC 
TTCCGGGACG 
TTCGCCCTGC 
NNNNCGGACC 
CAATAGCATC 
GTCCAAACTC 
GGATCGATNN 
CGTTGCTGGC 
CAAGTCAGAG 
GCTCCCTCGT 
TCCCTTCGGG 



CGGGTACCCG 
TCCTTGGGAG 
GCTCGTCCGG 
CTGGCCAAGA 
TTGTCAACCT 
CTAAAAGGGC 
CCAGGACATC 
CACCGCTCTC 
CCCACCGCCA 
GCCCCAAAAA 
CCTCTTAAGA 
CAAAATGAAA 
GTTTGACGAC 
ATGCAGCGGA 
AGGCAAGACG 
CTCACCTAGC 
CAGTTCTTGT 
CACCTTGCTT 
TCAGCTCCTA 
AGCCCCCATC 
CCAAAAAAGG 
AGCCCTGCCC 
TACCACTTTT 
TTTAAAACTA 
AGCAGACTCC 
GCAGTTCTCC 
CTTAGGTGCA 
TGCCCTAAAC 
AAACTGGACC 
GGATGAGCCA 
AC AGTTC ATC 
AGGCCTAGGT 
CCAAGTCTTA 
AGTTCTCCAA 
AGC C TT AC AA 
AAGAACCCTA 
GACCGGGCTG 
CCTACTCATA 
TAGACTTAAT 
GGAAGAAGCT 
CAT AGT A T AA 
CGGTGCTCAC 
TCTCCCGGGA 
TCATCAGCGC 
GCGGCCTGGA 
CCTCCGGGCC 
GCGACCCGGC 
GGTCGACTTG 
ACAAATTTCA 
ATCAATGTAT 
NNACATGTGA 
GTTTTTC CAT 
GTGGCGAAAC 
GCGCTCTCCT 
AAGCGTGGCG 



TGTATCCAAT 
GGTCTCCTCA 
G AT CTGGAG A 
TCCCCCGGGC 
TGCTCAATTC 
TGCAAGGCGT 
CACAAACTAG 
AAGGATTGGA 
TAAAGGTTGA 
CCCCTGGACC 
TAAGACTCTC 
CTCCCAACAG 
CCCCGCAAGG 
GGGCAGGTAT 
GCCTACTTAA 
GGGGGAGAAC 
TATACTGAAT 
AAAATACGGT 
CAGTCCCCTT 
CAT ATC TC C G 
CTAGAACAAA 
AAAGTCAGAG 
AGGTTACTCC 
GGTACCCCTA 
CTAGCGAATG 
AACTCGTCCT 
GTCACCTTTA 
GGGTCAGTCT 
AGACTTTCCG 
GTCCCCATTC 
CCTTTACTAG 
GTCTCCGTCA 
TCCGGT AC CA 
AATAGGAGGG 
GAAAAATGCT 
CAAGAAGAAT 
CAGGGCTTTC 
CTAACCATTG 
GTTGTACATG 
CAGGATTGAG 
TACGACTCAC 
CGCGCGCGAC 
CTTCGTGGAG 
GGTCCAGGAC 
CGAGCTGTAC 
GGCCATGACC 
CGGCAACTGC 
TTAACTTGTT 
CAAATAAAGC 
CTTATCATGT 
GCAAAAGGCC 
AGGCTCCGCC 
CCGACAGGAC 
GTTCCGACCC 
CTTTCTCAAT 



AAATCCTCTT 
GAGTGATTGA 
CCCCTGCCCA 
TGCAGGAATT 
CTTCTCCCCC 
GCAGGCCCAA 
CCACCCATTT 
GCCTCGTTGG 
CGGGATCGCC 
AGAAACTCCC 
CCGTGTCTGA 
GAATGGTCAT 
CTATCGCATT 
CCGAGGCCCC 
TGACCAACCA 
TC C AG AACTG 
AC CGGC AATG 
CTGGGAGCCT 
GTAGGGGCTC 
ATGGTGGAGG 
TTCATAAGGC 
ATGACCTTAG 
AGATGTCCAA 
CCCCTCTTGC 
CCTCCTGTCA 
GTTTATCTTC 
CTAACTGCAC 
TCCTCTGTGG 
TCCAAGCCTC 
CTGCCATTGA 
CTGGACTGGG 
CCCAGTATAC 
TACAAGATTT 
GACTGGACCT 
GTTTTTATGC 
T AC AAAAAC G 
TTCCGTACCT 
GGCCATGCGT 
CCATGGTGCT 
GCGCCTAGTG 
TATAGGAGGG 
GTCGCCGGAG 
GACGACTTCG 
CAGGTGGTGC 
GCCGAGTGGT 
GAGATCGGCG 
GTGCACTTCG 
TATTGCAGCT 
ATTTTTTTCA 
CTGGATCCAG 
AGCAAAAGGC 
CCCCTGACGA 
TATAAAGATA 
TGCCGCTTAC 
GCTCACGCTG 



CAGGC GC C AT 
TTCGCTATTA 
GCCAGGGTTT 
ATTTGTTAAA 
GCTAAAACCA 
AGGGGGGAAT 
GCAAGGCATG 
GAAAACAGCT 
GGGGCCAAGA* 
CCCCAGATAT 
CCCAAGGACC 
TTCTGTTCGC 
CGCUCCAGTC 
GCTGTTGCAT 
CTACCCGTCT 
GGGACCACCG 
TATGAAATCC 
TCCGATCCTA 
ATCTGGACAC 
CAGGTGGGAG 
AAGGGACCTT 
GCCTGGATTC 
AAAACCTGGA 
CTGCTAATCC 
TTTATGTAGC 
AGTACAAAAA 
ACCGAACTCC 
AAAATGGAAA 
CCCCTGTAAC 
CAGGCGAATT 
CAACGAGGTA 
TATAAATCAG 
ACCCCTCGAT 
TATGACTCCT 
CCTTGATGCA 
TTTT AGC CTT 
GATACCCACT 
GATTATACCT 
CCCTTTCATT 
CTCTGTAGCC 
AAATAACATG 
CCTCCTCCCC 
TCATTATATA 
AATCAC C GCA 
AAAATTATCC 
AC AAGAC CAG 
ACT AAC GGCA 
TAACAAGTCA 
CAGGGAAAGC 
CCTACCTCTC 
TTTCAGTCGC 
GGCCCAGCAA 
TTGACAATTA 
CCACCATGGC 
CGGTCGAGTT 
CCGGTGTGGT 
CGGACAACAC 
CGGAGGTCGT 
AGCAGCCGTG 
TGGCCGAGGA 
TATAATGGTT 
CTGCATTCTA 
ATCTGGGCCC 
CAGGAACCGT 
GCATC AC AAA 
CCAGGCGTTT 
CGGATACCTG 
TAGGTATCTC 
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= SESSg SSSSS! iSSSS SSS 0 ^ 

ACAGAGTTCT SSXSS "EE «« 

TGCGCTCTGC TGAAGCCAGT TACCTTCcSa A^GAG^G G^TCTTC JScSS 4320 

CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC ACCESS? £2^^** 4380 

AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT rTrar£r»r£f ?S5° AGAAAA 44 4 0 

AACTCACGTT AAGGGATTTT GGTCATgJgI SKSSS SStotcap =£^ GAA 4 500 

TTAAATTAAA AATGAAGTTT TAAATCAATC TAAAGTATAT SSSJ? " 4560 

gcttaatcag tgaggcacct atctSSI SSSS tcg^?S^c 
^JTgcct gactccccgt cgtgtagata actacgatac gggagggctt IccItctgg? 
cccagtgctg caatgatacc gcgagaccca cgctcaccgg CTCCAGATTT ATCAr™*™ 

CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG SiSSS C^S^e" 
^TCTATTA ATTGT, TCCCG GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA TAOT^GC 
™ JISII G CCATTGCTAC AGGCATCGTG GTGTCACGCT CGTCGTTTGG TA^G^C^rf 
TTCAGCTCCG GTTC CCAACG ATCAAGGCGA GTTACATGAT CCCCCATGT- Gr^fflT^t 
^a£™ JCTTCGGTCC TCCGATCGTT GTCAGAAGtI AG^G^Sc AGTGTTATCA 
CTCATGGTTA TGGCAGCACT GCATAATTCT CTTACTGTCA TGCCATCCCT aaA»tv?P^ 
TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT AGTGTATGCG ^r»^ f 16 ° 
TGCTCTTGCC CGGCGTCAAT ACGGGATAAT A^GcbEc^ ATAGCAGAAC TTTAAAAOTr 
CTCATCATTG GAAAACGTTC TTCGGGGCGA AAACTCTCAA GGATCTTACC G^G^aJ! f?52 
T^^GGA TGTAACCCA <= TCGTGCACCC AACTGATCTT CAGCATCTTT 

AGCGTTTCTG GGTGAGCAAA AACAGGAAGG CAAAATGCCG CAAAAAAGGT AA^IaS^ f 4 °° 
ACACGGAAAT GTTGAATACT CATACTCTTC CTTTTTCAA? ^t^?^ 546 ° 

5S" A ^ GTC TCATGAGCGG ATACATATTT GaSSS? aS^S^ aStI^ 
ar™*^ CATTTCCCCG AAAAGTGCCA CCTGACGTCT AAGAAACCAT 7^T^ II! 2 

A A^^ CT ATAAAAATAG GCGTATCACG AGGCCCTTTC GTCTCGCGCG TTTCGGTGAT 
GACGGTGAAA AC CTCTGAC A CATGCAGCTC CCGGAGACGG TCACAGC^TG Trr^IS^ 
? A ! G £ CGGGA GCAGACAAGC CCGTCAGGGC GCGTCAGCgS ra^cS G^CG^ 
TGGCTTAACT ATGCGGCATC AGAGCAGATT GTACTGAGAG TGCAC GTGTCGGGGC 
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AGATCTCCCG 
AGCCAGTATC 
TAAGCTACAA 
CGTTTTGCGC 
AGTTATTAAT 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGACT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 



ATCCCCTATG 
TGCTCCCTGC 
CAAGGCAAGG 
TGCTTCGCGA 
AGTAATCAAT 
TTACGGTAAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 



CGGTGGGAGG 
GCTTATCGAA 
CTTTTTCGCT 
TAGAATGGGA 
CTTTC TACTC 
TTCGTTAAAC 
AGATTGTAAG 
TACTTCAGCA 
GCATATAAAT 
CATCATCCTG 
AT AC T CTG AG 
ACAGCTCCTG 
GGAACAGCAT 
TTAACCCGTG 
GCCCCCATCA 
CCAATGCCAC 
TATGTGATCT 
GCTGCAAATA 
GGCATACCGT 
GTGAAACCAC 
AGCGCGGTAA 
ACCTCTCCAA 
TAGTCCTAGA 
GACTGAGACT 
TCCTCAATAT 
CCCCCTCCCG 
CAGCCTCTAT 
TGCTAAACCT 
AAGAATGTTG 
GCACTTATAC 
TTACCCTATC 
AGGCCTTATG 
CTGGAACAAT 
ATCTAACCAC 
C C G ATT AT AT 
TGACCCTGGC 
CGGGGACCAC 
CAGACCTCAA 
CTGAAGTAGT 
TCTGCGCAGC 
ACAGCATGGC 
AAGGTTGGTT 
TCATGGGACC 
GATTAGTTCA 
AATACCACCA 
TCATCGGCAT 
AAGTTGACCA 
TGGACCGACC 
CGGGACGACG 
CTGGCCTGGG 
TCCACGAACT 
GGGCGGGAGT 
CAGGACTGAN 



Art JAj IHjI'AA 

TCTATATAAG 
ATGTCGACTG 
ATTGTAAAAT 
AGATGTCCCT 
TGTTGACAAC 
TTTAGCTTGC 
TACTTTCTCT 
CAGTTT T AG A 
TCTGGC TGGC 
CCTTTCTCTT 
TCCAAACCGG 
GGCAACGTGC 
CAGGACCGAC 
GAAGTCCTTA 
GGTCTTTAAT 
CTCCCTTTTA 
GGTCGGAGAA 
CCCCGGAGGG 
AAAATCGGGG 
CGGACAGGCT 
CACCCCCTGG 
AGTATCCAAT 
ATTCACTGAT 
GTACCGGACA 
AGGGCCCCGC 
ACCCGTGCAG 
AGTCCCTGAG 
GGTAGAAGGA 
GCTGTGCTTA 
CAATCATTCT 
TGAAGTGACA 
TAACACCACC 
GTGGGCTTGT 
AGACTATTGT 
GTATGGTCAG 
CCTTCTGCTA 
TGCCCTAATC 
CGAAGTCGAA 
CCTACAGAAC 
CCTAAAAGAA 
CAAACTAAGG 
CGAAGGGCAG 
TCTAATAGTA 
AT TTGT T AAA 
GCTAAAGCCT 
AGT AT AC GGC 
GTGCCGTTCC 
GGCTCGGGTT 
TGACCCTGTT 
TGTGGGTGCG 
TCCGGGACGC 
TCGCCCTGCG 
MNNCGG AC C G 



GTCGACTCTC 
TTGTGTGTTG 
CTTGACCGAC 
TGTACGGGCC 
TACGGGGTCA 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTCTC 
AGAACTTCAG 
TCATGTTATA 
TGTATCACCA 
CATTGTCTCC 
ATTTGTAACG 
AATCACTTTT 
GAACAATTGT 
GTGGAAATAT 
TATGGTTACA 
GCCCCTCTGC 
TGGTTGTTGT 
ATGGAAGGTC 
AT GGTCAT GG 
GTAACCTGGA 
GGAACTGTAC 
GAGTGGGACC 
AGAAAGCGGA 
TGTGGGGGGC 
TACTGGAAGC 
GACACGGGAT 
TCCTTCCAAG 
GCAGGAAAAA 
GG AACAGATC 
ATCCCCATTG 
ATCAGGCTCC 
ACTGCCCCAC 
GC CTATCAGG 
GTGTCGGGAC 
ACCGCCCCGG 
GGACAGGGCC 
CAAAGTGCCG 
AGCACTGGAT 
GTATTAGTTG 
CTTGAACAGC 
GGAGGATTAA 
AAAACCCAGC 
AAATC AATT A 
CGAAGAGGCC 
GAATGTTGTT 
GAAAGGCTTA 
TTTAATAGAT 
CTCTTACTGA 
GACAGGATCT 
ATAGAGTACG 
ATAGTATAAT 
GGTGCTCACC 
CTCCCGGGAC 
CATCAGCGCG 
CGGCCTGGAC 
CTCCGGGCCG 
CGACCCGGCC 
GTCGA 



AGTACAATCT 
GAGGTCGCTG 
AATTGCATGA 
AG AT AT AC GC 
TTAGTTCATA 
GGC TG AC C GC 
ACGCCAATAG 
TTGGCAGTAC 
AAA TGGC CCG 
TACATCTACG 
GGGCGTGGAT 
GGG AOTTTGT 
CCATTGACGC 
TGGCTAACTA 
GGTGAG TTTG 
TGGAGGGGGC 
TGGACCCTCA 
TCTTATTTTC 
AATTTTTAAA 
TTTTCAAGGC 
TATAATTAAA 
TCTTATTGGT 
ATGATATACA 
TAACCATGTT 
GCTGTCTCAT 
CAGCGTTCTC 
GGGTCTATTT 
GAGTCACCAA 
AAGATGCCTT 
CTTCAGACCA 
CCCGGACTTT 
CAAGAGAGGG 
CCACATCATC 
GCTCCAAAAT 
GGGCTACTCG 
AGGCTAATTG 
CTATTACCAT 
GGCCTAATCC 
CCAGGCCTCC 
CTTCTCAACA 
CGCTTAACCT 
CTCCTTATTA 
CCAGCTGTAC 
TATGCATGGG 
GCTCAGGATC 
TGACTCCCTG 
AGCTCTGGCC 
GTACCAAATA 
CCATGGGAGG 
AG TTTG AG CA 
CCAACCTAGA 
TAGATTTGCT 
TTTATGCAGA 
ATCAGAGACA 
CCCCCTGGTT 
TCTTACTCTT 
CAGTAGTCCA 
AGCCATAGGG 
ACGACTCACT 
GCGCGCGACG 
TTCGTGGAGG 
GTCCAGGACC 
GAGCTGTACG 
GCCATGACCG 
GGCAACTGCG 



GCTCTGATGC 
AGTAGTGCGC 
AGAATCTGCT 
G TTG ACATT G 
GCCCATATAT 
CCAACGACCC 
GGACTTTCCA 
ATCAAGTGTA 
C CTGGCATT A 
TATTAGTCAT 
AGCGGTTTGA 
TTTnr , r , ?icc - 
AAAT GGGCGG 
GAGAACCCAC 
GGGACCCTTG 
AAAGTTTTCA 
TGATAATTTT 
TTTTCATTTT 
TTCACTTTTG 
AATCAGGGTA 
TGATAAGGTA 
AGAAACAACT 
CTG TTTGAGA 
CATGCCTTCT 
CATTTTGGCA 
AAAACCC CTT 
AAGAGTAGGG 
CCTGATGACT 
CCCAAGATTA 
GGAAC CAT AT 
TGACTTTTAC 
CTACTGTGGT 
ATGGGACCTA 
GGCTTGTGGC 
AGGGGGCAGA 
GGACGGGCCC 
GTTCTCCCTG 
CGTGATCACT 
TCAGCCTCCT 
ACCTGGGACG 
CACCAATCCC 
C GAAGGAGT A 
GGCCACTTCC 
AGCACTACCT 
CTACTACCTT 
CTTGTCCACC 
CAGAATAATT 
TAAGAGGGAG 
GATTGCAGCT 
GCTTCACGCC 
AAAGTCAC TG 
CTTCCTAAAA 
CCACACGGGA 
AAAACTATTT 
TACCACCTTA 
TGGACCCTGC 
GGCTTTAGTC 
CGCCTAGTGT 
ATAGGAGGGC 
TCGCCGGAGC 
ACGACTTCGC 
AGGTGGTGCC 
CCGAGTGGTC 
AGATCGGCGA 
TGCACTTCGT 



CGCATAGTTA 

GAGCAAAATT 

TAGGGTTAGG 

ATTATTGACT 

GGAGTTCCGC 

CCGCCCATTG 

TTGACGTCAA 

TCATATGCCA . 

TGCCCAGTAC 

CGCTATTACC 

CTCACGGGGA 



TAGGCGTGTA 
TGCTTAACTG 
ATTGTTCTTT 
GGGTGTTGTT 
GTTTCTTTCA 
CTGTAACTTT 
TTTATTTGTC 
TATTATATTG 
GAATATTTCT 
ACATCCTGGT 
TGAGGATAAA 
TCTTTTTCCT 
AGGATCGGCC 
AAAGATAAGA 
ATGGCAGAGA 
GGGCGTACCG 
TATTTTGATC 
GTC GGGTATG 
GTGTGCCCTG 
GAATGGGGTT 
ATCTCCCTTA 
CCCTGCTACG 
TGCAACCCTC 
AAATCGTGGG 
ACCCGCCAGG 
GGTCAACTAC 
CCTACAGGCG 
GGAGACAGGC 
GACAAGACCC 
GCGGTCGTGG 
CAACATAAGC 
AAAACT CACC 
GCAGCACCCG 
ACGATGCTCA 
TACCACTCCC 
CCAGTATCGT 
GGAATAGGGA 
GCTATCCAGA 
ACCTCGTTGT 
GAGGGAGGTC 
CTAGTGAGAG 
GAGTCAGGCC 
ATCTCCACCA 
ATTCT CAATC 
C TG AC TCAAC 
T G AC AATT AA 
CACCATGGCC 
GGTC G AGTTC 
CGGTGTGGTC 
GGACAACACC 
GGAGGTCGTG 
GCAGCCGTGG 
GGCCGAGGAG 
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